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PREFACE 


During the last few decades the physical sciences have advanced with 
seven-league boots, It would be difficult to name a single branch of human 
knowledge that has not been affected significantly, or at times even radically, 
as a result of the exceptional achievements of physics. Modern engineering is 
to a large extent the creation of physics. New ideas and facts discovered by 
physicists become an integral part of man’s knowledge of the surrounding 
world. The problem of the structure of matter, whose investigation previously 
appeared as a peculiar form of mental exercise, has come to the forefront in 
science as a most important task that is inseparably linked with the develop- 
ment of civilisation. 

A broad knowledge of physics is a necessity for the specialist working in 
any branch of science or engineering if he desires to comprehend the fundamen- 
tals of his field of knowledge and is striving to take a creative part in its 
development. The task of a course in physics for students of a technical insti- 
tute consists, therefore, in helping them to understand the physical basis of 
engineering. 

In addition to this main task, a course in physics in a technical institute 
should be organised in such a manner as to help the student to master experi- 
mental techrique and acquaint him with equipment used to measure physical 
quantities. Skill in experimental physics is attained by working in the labora- 
tory. It seems to us that familiarising oneself with experimental physics is 
a completely distinct task in the study of physics in technica: institutes. The 
interweaving of experimental physics with the study of general physical laws 
and phenomena is only occasionally pedagogically justified. This is due to the 
fact that modern experimental physics cannot be sharply subdivided. TRe meas- 
urement of coefficients of expansion is accomplished with the aid of interfero- 
metry, radio equipment is required for experiments in mechanics and heat, and 
the investigation of the structure of metals is inseparably linked with experi- 
ments in electricity. Physical experiments conducted with the aid of outmoded 
techniques are of interest only to specialists in physics desiring to trace the 
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development of one or another experiment. It would probably be most expedient 
to arrange the curriculum in such a manner that laboratory work followed 
a course in general physics. 

Thus, the author believes that lectures in physics, and consequently the 
corresponding textbook, should include only outlines of experiments, i.e., the 
goal of the experiments. 

Once agreed on the necessity for excluding experimental physics from our 
course, we must then choose between the inductive approach (from particular 
experimental facts to theoretical generalities) and the deductive approach 
(from theory to its experimental corroboration and manifold applications). In 
a very extensive course, it is probably possible to combine these two approaches 
as they are linked in the development of science. This possibility was not open 
to the author and so the second approach was chosen. Presentation of the basic 
theoretical propositions, the deduction of corollaries that could be verified 
experimentally, and then the illustration of these experiments by means of 
diagrams—this was the approach adopted in practically every chapter of this 
book. Naturally, this meant that the historical method had to be completely 
disregarded. The history of the origin of ideas, the formulation and discard 
of physical theories, remained beyond the scope of this book, since it is 
written for the student who is not aiming to be a_ professional physicist. 
It seems to me that only such a method of presentation makes, for clarity 
and conciseness. 

This approach is also dictated by the structure of the course. If the funda- 
mental laws of physics are regarded as of paramount importance and we care- 
fully consider. the body of facts pertaining to a particular complex of theoret- 
ical ideas, then it would be expedient to separate the presentation of the 
strictly phenomenological theories and their corollaries from the problems relat- 
ing to the structure of matter. Classical mechanics, thermodynamics, statistical 
physics and electrodynamics are self-contained and to a large extent in them- 
selves complete fields of physics. Their presentation should precede the considera- 
tion of the problems relating to the structure of matter. It is expedient to con- 
sider the connection between physical properties and the structure of matter 
after the fundamentals of classical physics have been presented and after the 
theories of the structure of matter have been discussed. 

In many books on physics, problems related to the structure of matter and 
problems connecting the properties of matter with its structure are scattered 
through various parts of the course and to a large extent are lost, i.e., dis- 
persed in other topics. It seems tome that this is not justified both from the point 
of view of the logical structure of the course as well as from the viewpoint of 
the importance of these problems, especially if it is borne in mind that many 
institutions of higher education teach chemistry and technology, and specialise 


Preface 15 





in matter and materials. The attention of the reader is called to the fact that 
problems related to the structure of matter are discussed in a separate, conclud- 
ing section. This is clearly the basic difference, as regards presentation, 
between this book and others. 

Another distinguishing feature of this textbook on physics is the integrated 
presentation of problems in radio physics and optics. This helps to give the 
reader a better understanding of the essence of physical phenomena. : 

In view of the relatively short time allotted to a course in physics, one is 
forced to seek methods of presentation that are as condensed and concise as 
possible. At the same time, it is impermissible to treat superficially a number 
of problems in physics, the knowledge of which is a sine qua non today for 
every qualified chemist, technologist and engineer. The reconciliation of these 
two contradictory requirements placed the author in a difficult situation. He 
decided, finally, to completely exclude repetition of all secondary-school material. 
However, this measure alone was insufficient to make room in this single vclume 
for all the important problems of physics. To further reduce the size of the book, 
a concise presentation of the material was necessary. Some people feel that lectures 
and textbooks should include selected problems which are exhaustively dis- 
cussed, and all that cannot be presented historically and logically in complete 
detail should be excluded from the physics course, Such a viewpoint, it seems 
to me, is basically incorrect. A detailed knowledge of certain topics is not 
essential for the future engineer. At the same time, however, he must have 
a general notion of these topics, if only to be aware of the existence of these 
phenomena and laws. Therefore, on various occasions only the theoretical con- 
clusions have been presented and our knowledge in a number of branches of 
science is merely summarised, i.e., a detailed description of the manner in 
which this knowledge was obtained has not been given despite its interesting 
and educational nature. 

At first glance, it may appear that the author, overlooking the fact that 
mathematics and physics generally are taught concurrently, has not employed 
mathematics wisely. Thus, the definite integral symbol appears in our book 
before the student has studied the integral calculus. However, an understanding 
of the text does not require that the student be able to integrate, but that he 
merely be familiar with the concept of the definite integral. It would be in- 
advisable and tumbersome to avoid such formulas in a physics textbook, and it 
would result in an impoverished presentation of the physical concepts invol- 
ved. A more profitable approach is to introduce some minor changes in the 
curriculum with respect to mathematics, so that the students are initiated into 
the concepts of mathematical analysis early in the course. This suffices for they 
student to be able to read books using these symbols. The process of integra- 
tion may be studied considerably after the relatively easy concept of the defi- 
nite integral has been assimilated. 

The interconnection between the various branches of physics has its effect on 
the presentation of theory and experimental results. If the lecturer desires to 
maintain logical presentation in the course, he must inevitably resort to repe- 
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tition and cross-reference. There are innumerable examples of this. Thus, on 
the one hand, it is not desirable to separate the presentation of X-ray diffrac- 
tion from optical diffraction. On the other hand, it is impermissible to disasso- 
ciate the latter problem from the problems of crystal structure. Or, as another 
example, it is difficult to treat problems of permittivity and dipole moments 
in separate sections, but at the same time it is not possible to discuss dipole 
moments without discussing other problems related to the structure of molecules. 
Furthermore, permittivity and coefficients of refraction must be discussed. 
together. There is, of course, only one solution: in lectures, repetition is essen- 
tial (very useful, incidentally, for students); and in the textbook, recourse 
must be had to cross-reference. Owing to cross-referencing, a physics textbook 
cannot be read a single time in consecutive order. This also means that in order 
to properly understand one portion of the physics field, familiarity with the field 
as a whole is generally necessary. 


A. Kitaigorodsky 


PART ONE 
MECHANICAL AND THERMAL MOTION 





CHAPTERI 
THE FUNDAMENTAL LAW OF MECHANICS 


1. Kinematics 


Equations of Motion of a Particle. If the dimensions and shape 
of a body are of no consequence in the consideration of a partic- 
ular phenomenon, we can conceive of the body as being represent- 
ed by a point. This approximate representation of a body by 
a material (i.e., mass) point is not only justified when the dimen- 


OG 





Fig. 1 


sions of the body are small relative to other distances considered 
in the problem, but is permissible whenever we are only interested 
in the motion of the centre of mass of the body. 

‘In order to describe the motion of a particle, one must indi 
cate through which points in space the particle has passed and 
the instants of time during which it was located at one or another 
point of the path. For this purpose, it is necessary, in the first 
place, to select a coordinate frame of reference (Fig. 1). The loca- 
tion of a point in such a coordinate system, which in its simplest 
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form is right-angled, is determined by the three coordinates x, y, 
z, or by the so-called radius vector r, drawn from the origin of 
the. coordinate system to the given point* (Fig. 2). 

Thus, motion in space can be roughly described in the form 
of a table of values for r (each value being given by three quan- 
tities!) for the instants of time ¢,, ¢,, etc.; or accurately de- 
scribed in the form of a continuous function r =f (f) (in essence, three 





Fig. 2 Fig. 3 


functions, e.g., x=/f,(, y=f, (4), z=f, (4); or r=9Q, (), a=@,{?), 
B=¢, (2); etc.). 

The vector equation r=/(t) or, what amounts to the same, the 
three equivalent scalar equations are called the equations of 
motion. 

Average Velocity. Let us consider AB, a portion of the path. 
Assume that at the instant of time ¢ the moving particle was 
at A, and at the instant of time ¢-+-At at B (Fig. 3). Let us 
introduce the radius vectors r, and rz. We know that during the 
interval of time At, the particle moved from A to B. It is there- 


fore natural to call the vector AB the particle displacement vector. 
Vectors may be added by the parallelogram method. From 
Fig. 3, we see that 


rp=r,+A4B or AB =r,—r,=Ar, 


i.e., the particle displacement vector is the vector difference of 
the radius vectors. The curvilinear motion is determined by the 


* The radius vector r is given by its magnitude, r=V<+y?+2%, and the 


angles it forms with the coordinate axes: cosa=—, cos pat and cos y=. 
Thus, it is determined by three quantities: x, y and z; or r, a and B; or r, a 
and y; etc. (two angles determine the third, since cos? a -}+cos* B + cos? y= 1). 
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displacement vector Ar for time At, whereby the smaller Ar, the 
greater the accuracy. 
The average speed for the path AB is given by the relation 
AB 


Yaw Are 


This is the speed at which the body would have traversed the 
distance AB in uniform and rectilinear motion during the interval 
of time Af. 

Thus, motion over the path AB may be specified by giving 


the direction of the vector AB=Ar and the speed Vay In place 
of this, we introduce the vector 


=e 
AB_ Ar 


Pav = TE = Ke? 


which is equal in magnitude to the average speed and whose 
direction is that of the displacement vector. We can now:say that 
the motion of the body over the path AB is determined by the 
average velocity. 

Instantaneous Velocity. If we decrease the interval of time Af, 
the point B will approach point A. These points finally merge 


and the direction of AB then coincides with the tangent to the 
curve at the point of merger. 


As At decreases, the ratio a approaches a limit. The vector 


Vinst) Having the direction of the tangent to the curve at the 
given moment of motion and numerically equal to the limit of 


the ratio “2 as At—+0, is called the instantaneous particle 


At 
velocity: 
V inst = limit ar when At—0. 


In other words, the instantaneous velocity is the derivative of the 
vector r with respect to time: , 
; dr 

Mas 
It should again be emphasised that it is not absolutely essential 
to employ vectors in order to describe motion. Instead of usMhg 
the concept of vector velocity, we could speak of the absolute 


value of the velocity, ler ,* and indicate the direction of motion. 


* The vertical bars | | indicate that only the absolute value (modulus) of 
‘the vector between the bars is being considered. 


- 
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If we did this, however, the same rules and the same experimental 
facts would require more cumbersome and more wordy formula- 
tions. Vector notation corresponds to physical experience, and is 
moreover concise and expressive. A certain amount of effort, how- 
ever, is required to become accustomed to it. 

Since the projections of the vector r on the coordinate axes 
are the coordinates of its. terminus, x, y and z, the projections 
of the velocity vector are: 

dx _ ay 
a= ap? Oy at 


d 
and v= 5: 


v 

Acceleration. To continue our consideration of curvilinear motion, 
let us draw arrows to represent the instantaneous velocities of the 
body in passing through the points A and B of its path. If we had 
not introduced the concept of velocity, we would have to describe 
the situation as follows: the speed at B is different from that at A; 
moreover, the direction of motion has changed. Using the concept 
of velocity, we can state more briefly: the velocity at B is differ- 
ent from that at A. 

Velocity can change in magnitude and direction. 

If the path AB is rectilinear, the vectors v7, and %, have the 
same direction. The change in velocity is obtained by arithmeti- 
cally subtracting the magnitude of the vector v, from the magni- 
tude of the vector d,. 

Let us now consider the curvilinear path AB; vectors v, and v, 

differ in magnitude as well 
y, as in direction. To deter- 

mine theincrease inthemag- 
ay nitude of the velocity, it is 
necessary, as before, to sub- 
tract the magnitude of the 
V, vector v, from the magni- 
tude of the vector d,: 
A|9|=|%3|—|9,|- 
However, this quantity does 
not, of course, completely 
express the change that has 
occurred in the motion. 

Let us now subtract vec- 
tor %, from vector ¥, in 
accordance with the laws 
for operating on vectors. 
Fig. 4 shows vector 





Fig. 4 AV=VUp—V,q.- 
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Vector ¥,, the sum of Av-+v,, is the diagonal.of the parallelo- 
gram constructed on these vectors. 

Vector Av is called the velocity increment. The magnitude of 
this vector in the case of curvilinear motion is not A]v|=|%9,|/— 
—|v,|. From the figure, it is evident that the magnitude of the 
increment vector |Av| is greater than A|o|, the difference in the 
magnitudes of the velocities. To determine the velocity at point B, 
one must add velocity v, and increment Av by the parallelogram 
method. 

We can now determine the acceleration for curvilinear motion 
as follows. The ratio of the velocity increment to the interval of 
time during which this increment takes place is called average 
acceleration: ‘ 

x) 


ag= Ai’ 


When the interval of time Af is decreased, this ratio approaches 
a limit. The vector 


Ging = limit $? when At—+0 


is called the instantaneous acceleration of a body at a given mo- 

ment of motion. In other words, acceleration is the derivative of 
velocity: 
dv 

a= 


and 
Ge Gp y= a a aE: 


The acceleration vector uniquely determines the nature of the 
change in the velocity of the body. 

Generally speaking, the acceleration vector can form any angle 
with the curve. This angle determines the nature of the acceleration 
and the curvature of the path as follows. Through the point of the 
curve that is being considered, a circle is drawn that has a common 
tangent with the path of motion at this point, and for the given 
portion of the curve most accurately approximates it. This circle 
is called a tangential circle* and its radius @ is called the radius 
of curvature at the given point. The acceleration vector is always 
directed into this circle. If the motion is accelerated, the vector a 
forms an acute angle with the curve (i.e., with the tangent to the 
path at the given point). If the motion is retarded, this angle will 


* The tangential circle and the calculation of radius of curvature is studied 
in detail in courses on differential geometry. 
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be obtuse. Finally, if the magnitude of the velocity does not change, 
ihe acceleration vector is directed normal to the curve. 

These statements can be proved rigorously, but we shall merely 
illustrate them geometrically here (see Fig. 5). 

In line with the above discussion, it is customary to resolve the 
acceleration vector into two components (Fig. 6): 


a=aQ, +4, 
Since the vector triangle is right-angled, 
a= ay +n. 
The vector a;, directed along the curve, represents the change in 
the magnitude of the velocity and is called the tangential acceler- 
ation. It is not difficult to 


V, Ves x a show that the tangential 
<5 acceleration 
a,= limit ale when 


At—0, i.e., a,= el, 








dav ; 
where A|v| is the incre- 
Vo>M ment in| fie magnitude of 
the velocity. 

The vector a,, directed 
normal to the curve, repre- 
sents the change in the di- 
rection of the velocity and 
is called the normal acceler- 
ation. The normal acceler- 
ation a, is related by a 
simple formula to the speed 





dav V,=V, 


Fig. 5 Fig. 6 
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v and the radius of curvature @ at the given point, namely: 


v2 
ae ee 


From this formula, which is derived in courses in theoretical mechan- 
ics on the basis of geometrical considerations, it follows that mo- 
tion with a constant normal acceleration (a, and v constant quan- 
tities) is circular motion. In this case, @ is a constant quantity 
ay points along the path and is equal to the radius of the 
circle. 


The normal acceleration a, = mis often also called centripetal 


acceleration. 

Centripetal acceleration of a body moving in acircle with radius 
R can also be expressed by means of the period 7, the frequency v, 
or the angular velocity w of this motion. Between these quantities 
and the linear velocity v, the following simple relations exist: 


tb 


2nR 1 


It 
> v= oR, ver and o= 


v= Tv . 

The last two formulas define the auxiliary quantities v and o. 
Thus, the centripetal acceleration for the motion of a body in a 

circle can also be written in the form: 


2 
a,=o'R or a,= eR. 


It should be emphasised that the everyday understanding of the 
word “acceleration” is much more limited than in physics. The con- 
cept of acceleration in physics includes retardation (negative accel- 
eration) and, what is most important, includes uniform motion if 
this motion is along a curved path. Only motion that is simulta- 
neously rectilinear and uniform is considered motion without accel- 
eration. 


Examples of acceleration. A proton in a modern accelerator moves inacircle 
with a normal acceleration of the order. of 10'® m/sec?. The linear acceleration 
of a modern rocket is ~ 30 m/sec®. The acceleration of a hockey ball is ~ 10 
m/sec®. The initial acceleration of an automobile is 1-2 m/sec?. The angular 
velocity of the rotor of a turbogenerator is 314 rads/sec and, at a distance of 
0.5 metre from the axis of rotation, particles move with an acceleration of 
~5x10* m/sec2. The angular velocity of a bicycle wheel is 7-10 rads/sec and, 
at a radius of 0.5 metre, particles on the rim have a normal acceleration of 
about 20 m/sec? 
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2. Force 


At the present time, three types of interaction are known to the 
physicist. The laws of gravitational and electromagnetic forces 
have been thoroughly investigated, and nuclear forces are being 
intensely studied. 

Gravitational Force. The force of attraction between heavenly 
bodies, which was discovered by Newton and is otherwise known 
as gravitational force, acts between any two particles in accordance 
with the law 


mm, 


F=y nr? 





x10" dynexcm?/gm’, m, and m, are the masses of 
the particles, and r is the distance between them. 

It can be rigorously proved, but we shall not do so here, that 
Newton’s law of gravitation written in the form valid for bodies 
having small dimensions (small with respect to the distance between 
them) is also valid for the interaction of a small body with a 
large sphere. The distance, here, is understood to be measured be- 
tween the centres of the bodies. 

The law of universal gravitation for the case of the attraction 
of a body by the Earth can, therefore, be written in the form 


where y= 


M 
P=) Rea 

where A is the height above the Earth’s surface and R is the radius 

of the Earth. For points close to the Earth’s surface, # is so much 


smaller than R that R-++-A may be replaced by R. Then, Faypum. 


Comparing this formula with the usual expression for weight, 
F = mg, we see that the gravitational acceleration may be expressed 
in terms of the gravitational constant, the mass of the Earth, and 
the radius of the Earth: 


Since the gravitational force is proportional to the masses, it 
is very large ‘for heavenly bodies and negligibly small for the ele- 
mentary particles. In the interaction between atoms, molecules and 
Other particles of matter, the gravitational force is of no signifi- 
canice. 

The force of attraction between the Moon and the Earth is 
2.3 10%§ dynes. between the Earth and a molecule of oxygen 
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~ 5x 107?° dyne, and between two oxygen molecules that are touching 
each other (3A=3x107* cm), ~2x107*’ dyne. These figures speak 
for themselves. 

Electromagnetic Force. If two particles or bodies have electric 
charges q, and q,, there is a force of attraction between them if 
the charges are of opposite sign and a force of repulsion if the 
charges are of equal sign. Quantitatively this relationship is expressed 


by Coulomb’s law: F=%2, As in the case of universal gravi- 


tation, this formula is valid for small particles. We shall show 
in Sec. 111 that magnetic force and electric force are intimately 
related. All electromagnetic interaction is of a single nature. 

The interaction between atoms, intermolecular forces, and the 
forces holding electrons about an atomic nucleus are all forces of 
electrical origin. In order to again demonstrate the negligible na- 
ture of the gravitational interaction between elementary particles, 
we compare gravitational attraction with the electric attraction 
between a hydrogen atom’s nucleus and its single electron: 


F,,=9X10-* dyne, while F,,¢9= 4X 10~" dyne! 


At first glance, it may not appear understandable why the inter- 
action between neutral atoms and molecules is of electrical origin. 
We shall go into this in more detail in Chapter 29. However, we 
should note here that the forces between the atoms and molecules 
co not depend on the overall charge of the particles (which is equal 
to zero), but on the local ccncentration of electric charge. 

Since intermolecular force is of electrical origin, surface tension 
and all cohesion forces between bodies are of the same origin. 
Frictional force is also essentially based on electric interaction. 

The elastic force that is developed when rubber or a compressed 
metal spring is extended is due to interatomic and intermolecular 
interaction. 

Thus, it too, in the final analysis, is electromagnetic in nature. 

Nuclear Force. There are forces between neutral particles in an 
atomic nucleus (also between a neutron and a proton and between 
two protons) that cannot be explained on the basis of electromag- 
netism. These forces decrease very rapidly with increasing distance 
between interacting particles. As a result, these forces do not exist 
beyond the bounds of nuclei and are evident only in connection 
with phenomena involving direct interaction of nuclei. 

Force Field. The space in which gravitational force is effective 
is called a gravitational field. Similarly, we speak of an electro- 
magnetic field. Any particle acted on by a force field can also create 
such a field. Thus, every particle creates a gravitational field and 
is acted on by gravity; and every electrically charged particle 
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creates an electromagnetic field and is acted on by an electromag- 
netic field. 

Thus, every interaction of particles is depicted in physics accord- 
ing to the scheme: particle—field—particle. The first particle 
creates a field, and this field acts on the second particle. 

The field created by particles is material. The properties of a 
field are essentially different from the properties of substance. As a 
result, it is often said today that matter has two forms—field and 
substance. The problems of the interrelation of field and substance 
are at present under intense investigation and cannot, as yet, be 
considered solved (see p. 552 for a more detailed discussion). 


3. The Fundamental Law of Mechanics 


Newton’s Laws. The fundamental law .of mechanics is the rela- 
tionship found by Newton between the forces acting on a body and 
the acceleration acquired by the bedy under the action of these 
forces. This law is usually formulated for particles. This in no way 
limits the universality of the law, inasmuch as a complex body 
can be considered, in principle, as the sum total of all its parti- 
cles. Moreover, Newton’s equation has extraordinarily broad direct 
application, since in most problems in mechanics we are either 
concerned with bodies having small dimensions or are interested 
only in the motion of the body’s centre of mass. 

The fundamental law of mechanics states the following. If the 
forces f,, f,, f,, etc., whose sum total is F=f, act on a body, 
the acceleration acquired by the body is equal to the quotient 
eal by dividing the resultant force by the mass of the par- 
ticle: 


a= 


3|% 


The equation also states that the acceleration vector coincides with 
the direction of the resultant force. The constant of proportionality 
in this formula is assumed to be equal to unity, which, the stu- 
dent will recall from his earlier training, depends on the choice 
of the system of units for the quantities entering into this equation. 

The fundamental law of mechanics may also be written in the form 


do 
F=m7z 


or F SR) The latter equation is equivalent to the former only 


if the mass does not change during the motion. We shall adhere 
to this condition. The case of variable mass will be considered 
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below. In Chapter 3, we shall briefly discuss the equation of mo- 
tion for bodies of variable mass in the range typical for rockets 
and, in Chapter 24, we shall consider the complications arising 
when a body moves with a speed approaching that of light (mechan- 
ics of the theory of relativity). 

The fundamental law of mechanics should be considered as a law 
generalising observed facts. This equation cannot be theoretically 
derived from any simple general considerations. 

The law of inertia follows directly from the fundamental law. 
If there are no forces acting on the body, the acceleration is equal 
to zero and the motion of the body is rectilinear and uniform. 

In applying Newton’s fundamental law to a particular body, we 
focus our attention on this body and consider the forces acting on it. 
It should not be forgotten, however, that force is-a measure of the 
interaction between bodies and that one-sided interaction does 
not exist. If one body acts on another; the latter also acts on the 
former. The measurement of force is equivalent to the measurement 
of interaction. Thus, the very method of measuring force assumes 
that the force of one body acting on another and the force exerted 
by the latter on the former are equivalent in magnitude. Since we 
are usually interested in one particular body, we focus our atten- 
tion on the force acting on it; the other force is called the force 
of counteraction or the force of reaction. The forces of action and 
reaction are equal in magnitude but are oppositely directed. This 
proposition has become known as Newton’s third law of motion. 

Relativity of Motion. A body at rest in one system of coordi- 
‘nates may appear to us, from another viewpoint, to be moving. 
The uniform motion of a person walking along the platform of a 
station will appear nonuniform if described in a system of coor- 
dinates based on a braked train. Therefore, when speaking about 
the law of motion, the frame of reference for which this law holds 
must be indicated. The system for which Newton’s laws are valid 
must, without fail, satisfy the following conditions: a body on 
which no forces are acting must move rectilinearly and uniformly 
or must be at rest. Such a system is called an inertial system. 

Thus, it is evident that all frames of reference that are execut- 
ing accelerated motion with respect to a body on which no forces 
are, acting are not inertial systems. Another important conclusion 
that immediately follows is that there is not merely one inertml 
system. In fact, an infinite number of inertial systems exist. An 
inertial system can be based on any body moving uniformly and 
rectilinearly with respect to some particular body on which no 
forces are acting. © 
. Let us assume that an inertial system has been selected. New- 
ton’s law, F=ma,. is valid for any body moving in this system 
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with a velocity uv and acceleration a. Now, let us consider another 
frame of reference moving rectilinearly and uniformly with a ve- 
locity w with respect to the inertial system. To be sure, in this 
system, the same body will have a different velocity, equal to the 
difference between the velocity v and the velocity uw, the motion 
of the second system with respect to the first. However, since the 
relative motion of these two systems is rectilinear and uniform, 
the acceleration of the body will be the same in both systems. 
Expressed mathematically, since acceleration is the derivative of 
velocity and the derivative of a constant quantity is equal to zero 

ua 

7 =0 |: 

ae dv d(wv—n) 

dt dt 





The acceleration of a body enters into Newton’s law, but the 
velocity does not. As a result, the fundamental law of mechanics 
is exactly the same in both systems. 

This important proposition, following from Newton’s law o 
mechanics, is called the principle of the relativity of motion. It can 
be summarised as follows: An infinite number of inertial systems 
exist, and, in such systems, the law of inertia and the F=ma 
law are satisfied. In this respect, none of these systems has any 
special advantage over the other systems. All inertial systems are 
equally suitable for the description of physical phenomena. 

The principle of relativity was first formulated by Galileo. 

Laws of Mechanics in a Noninertial System of Coordinates. Let us 
assume that the statement “acceleration is due to forces” is always 
valid in every system of coordinates. In noninertial systems of 
coordinates, a body executes accelerated motion even when it is not 
interacting with other bodies. But if this is so, noninertial systems 
possess, in addition to forces due to interaction, forces of different 
origin, i.e., forces resulting fromi the noninertial character of the 
system. These additional forces are called inertial forces (although 
jt would actually be more correct to call them noninertial forces). 
Since inertial forces do not result from interaction, they do not 
satisfy Newton’s third law of motion. 

We shall limit ourselves to a simple example of inertial force, 
for in this book we do not intend to employ noninertial systems 
of coordinates in the analysis of motion. 

Let us assume that, for certain reasons, it is convenient to select | 
a system of coordinates moving with an acceleration a, having a 
constant magnitude and direction. All bodies at rest or moving 
uniformly with respect to inertial systems will move with an ac- 
celeration —a@ in relation to the noninertial system selected. The 
acceleration —a is produced by a force —ma. 
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This is the inertial force for the case under consideration. It is 
not the result of the interaction of bodies, but is due to the 
accelerated motion of the reference system. 

If the body under consideration in the noninertial reference 
system interacts with other bodies, the inertial force is added to 
the forces due to interaction. 

The fundamental law of mechanics in noninertial systems of 
coordinates is written in the form: 


ma =F -+-inertial forces, 


where F is the resultant force due to the interaction of the bodies. 

The expression for the inertial forces will vary in accordance 
with the nature of the motion of the noninertial reference system 
(rectilinear, circular, circular with accelerated speed, etc.). Formu- 
las for inertial forces in a variety of cases can be found in books 
on theoretical physics. 


4. Application of the Fundamental Law 
of Mechanics to Accelerated Rectilinear Motion 


In.this section, we give several elementary examples illustrating 
the physical meaning of the fundamental law of mechanics, which 
states that the vector sum of the forces acting on a body is equal 
to the product of the mass of the body and the acceleration, and 
its direction is that of the acceleration. 

Horizontal Motion Under the Action of a Constant Force. An 
engine moves a trolley located on rails. Two forces act on the 
frolley in opposite directions —F,,, the frictional force exerted by 
the rails, and F,,, the force exerted by the engine. If these two 
forces are equal, the trolley moves uniformly. In order for the 
trolley to accelerate, the resultafft force must be directed parallel 
to a. Therefore, to produce accelerated motion, the motive force 
must be greater than the frictional force. Moreover, the difference 
between these forces is the resultant force, which according to the 
fundamental law of mechanics is equal to the product of the mass 
and the acceleration. Thus, 


F,, —F,,= ma. 


® 

- The frictional force is the result of the interaction of the rails 
with the trolley. Therefore, coupled with F,, is the force ‘exerted 
on the rails (F,,). Similarly, coupled with F,, is F,,, the force which 
the trolley exerts on the engine. 

The force F;, is the resistance force overcome by the engine 
(experienced by and acting on the latter). This is the force that 
would act on a man’s muscles if he were the source of motive 
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power. As can be seen, the resistance force F,, consists of two terms: 
the frictional force and the quantity— ma, which can be called the 
inertial resistance. Inertial resistance is always related to the effec- 
tive force acting on the accelerated body. It is equal to ma in 
magnitude but is oppositely directed to it. The inertial resistance 
could also be a single force acting on the accelerated body, which 
would be the case 
here if there were no 
friction. 

Let us consider an- 
other example of hori- 
zontal motion under 
the action “of a con- 
stant force. The load 
under consideration is 

Fig. 7 placed on a moving 

trolley having a back 

stop (Fig. 7). If there were no back stop, the load could slip off 

the trolley when the motion is accelerated. With no back stop, 

the fate of the load depends on the interaction between the trol- 

ley’s floor and the load. This interaction involves only friction. The 

trolley moves with a small acceleration @ and the force acting on 

the load, i.e., the frictional force, should be equal to ma.* But 

the static frictional force cannot have any magnitude whatsoever. It 
must be somewhat less than Fy,-*. If 


Max 


ma> Fy, ? 





motion-with acceleration @ becomes impossible and the load slides 
off the trolley. If there were no friction between the load and the 
floor of the trolley, the load woulg not move from its place, i.e., 
the trolley would move out from under the load. Let us now assume 
that the trolley has a back stop. The load is then prevented from 
sliding as soon as it comes into contact with the back stop. 
The back stop now pulls the load with the force F=ma. The force 
coupled with the motive force is the inertial resistance experienced 
by the back stop. It is also equal to ma, but is directed oppositely 
to the acceleration and acts on the back stop. 


Examples of force. Tfie force accelerating a passenger car is ~200 kg= 1,960 
newtons (1 newton is the force imparting an acceleration of 1 metre/sec? to a 
mass of 1 kg; 1 newton=10° dynes=0,.102 kg). The thrust developed by the 
jet engine of a modern aircraft is 10,000-20,000 kg=10*-2x 10° newtons and 
the tractive force developed by a T2-3 diesel locomotive is ~ 10,000 kg. 


* In accelerated motion, if some body is carried along merely because of 
friction (the body carried along is at rest with respect to the carrier), the static 
frictional force will always have the direction of the acceleration, 


Sec. 4] Accelerated Rectilinear Motion ai 


Vertical Motion of an Elevator. Let us consider the forces acting 
on a load located on the floor of an elevator executing nonuniform 
motion. 

Assume the elevator is accelerated upward (Fig. 8). Two forces 
act on the load: the Earth’s force, F,, and the force exerted by 
the elevator’s floor, F,, But now the resultant force must be differ- 
ent from zero, so F,,=4F,, Since the resultant force will be in 
the direction of the acceleration, F,,> 
>Fyz, and 

Pay Pep ma. 


The force F,,, is simply the gravitational 
force exerted by the Earth on the load. 
Thus, 

F,, —mg = ma. 


The magnitude of the force exerted by 
the load on the elevator, F,,, is exactly 
the same as F,,; thus, the resistance ex- 
perienced by the elevator in lifting the 
load is 





Centre of gravity 


F,,= mg +ma. Fey=g 


We see that this resistance consists of FoeFed 


the weight of the load and the inertial 
resistance. The force F,, is sometimes 
called the apparent weight. Fig. 8 

The result obtained is for the case 
when the acceleration of the elevator is directed oppositely to that 
of gravity. This condition is satisfied not only when the elevator 
is being accelerated upward, but also when it is decelerated during 
its downward motion. 

When the direction of the gravitational force and the acceleration 
of the elevator coincide, the force exerted by the load on the ele- 
vator (apparent weight) is 


F,,= mg — ma. 


From this formula, it is evident that the pressure against the floor 
of the elevator ceases when a=g, i.e., when the elevator fails 
freely in the gravitational field. In this case, the body ir® the 
falling elevator ceases to press against the floor and stretch the 
cable, i.e, the body, so to speak, ceases to have weight. 
Force on a Freely Hanging Load. Let us consider the motion of 
a plumb bob suspended from a trolley executing accelerated motion. 
For such motion, the string by which the plumb bob is suspended 
forms an angle with the vertical. Two forces act on the load: Fy, 
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the tension on the string, and F;,, the Earth’s attraction, which 
is equal to mg (Fig. 9). These forces are directed at an angle to 
each other. According to the fundamental law of mechanics, their 
vector sum is equal to ma and 
its direction is that of the ac- 
celeration. The diagonal of the 
parallelogram formed by the for- 
ces F,, and F, is, therefore, hori- 
zontal: 


ma=F,,+F x, 


The force coupled with Fz, is 
exerted on the Earth and does 
not interest us. We are, how- 
ever, interested in the force F,,, 
i.e., the force with which’ the 
load stretches the string. The 
value of this force, exerted on 
the string, is: 


F,,=— ma +F ry 


Thus, in this example too, the inertial resistance is a component 
part of the total resistance experienced by the accelerated body. 





5. Application of the Fundamental Law 
of Mechanics to Circular Motion 


Motion in a circle is accelerated motion. If a body moves in a 
circle with constant angular velocity, the magnitude of its acceler- 
ation is equal to w’R and its direction is inward along the radius. 

While executing uniform motion in a circle, the body may be 
under the action of any number of arbitrarily directed forces. 
However, in accordance with the fundamental law of mechanics, 
the vector sum of these forces, or, simply, the resultant force, must 
be directed inwardly along the radius (parallel to the acceleration) 
and the value of its magnitude must be 


mv? 
F centrip saci * Sea mo’R. 


The resultant force acting on the uniformly rotating body is called 
the centripetal force. We again emphasise that the resultant force 
always has the direction of the acceleration and not of the velocity, 
i.e., in our case, the force producing uniform circular motion is 
directed along the radius toward the centre of the circle and not 
along the tangent to the circular path. The role of the centripetal 


Sec. 5] ° Circular Motion 33 


force is to continually deflect the body from the rectilinear path 
along which it would move, as the result of inertia, if this force 
were not present. 

Example. A particle of mass m caught on a blade of a modern steam turbine 


(3,000 rpm, radius about 1 metre) experiences a centripetal force F = mw?r=m x 
X (314)? x 100= mx 10’ dynes (where m is in grams). The weight of the particle is 


7 : 
equal to mg. Thus, the centripetal force is ae or about 10,000 times, the 





weight of the particle. 


If a body is given accelerated motion, then, in conformity with 
the law of action and reaction, the accelerated body should act on 
other bodies (constraints) that make it accelerate rather than move 
in accordance with the law of inertia. The force of the accelerated 
body on the constraint has been called the inertial resistance. Such 
a force also exists, of course, for circular motion—it is called the 
centrifugal force. 

Centrifugal force and centripetal force are equal in magnitude 
but are oppositely directed. The centrifugal force is applied to the 
constraints of a body executing circular motion or, in other words, 
is applied to those bodies making the body under consideration 
move in a circle, and preventing it from moving rectilinearly and 
uniformly. As in the case of centripetal force, centrifugal force is 
a resultant—the sum of all the reactions exerted by a rotating 
body on its constraints. 

Let us consider several examples, limiting ourselves to the simple 
case of circular motion due to the interaction of two bodies. If 
body A prevents body B from moving rectilinearly and uniformly, 
and makes it move uniformly in a circle, 7 

y y 
(Pes us 






F,, is the centripetal force and Fp, is 


the centrifugal force. Such simpleinteraction Eye Wh 
occurs between a body located on a_bow!l- GQ : 
shaped pedestal, rotating about its axis in YU 





the horizontal plane, and the pedestal itself pred lh 
(Fig. 10). If the frictional force is not @5== 
very large and the pedestal is rotating rap- ie 
idly, the body slides to the wall of the 
bowl. In this case, the interaction between Fig. 10 
the body and the pedestal consists in the 
following: the wall of the bowl acts on the body inwardly aleng 
the radius (centripetal force), and the body, with a force of equal 
magnitude, presses against the wall outwardly along the radius 
(centrifugal force). 

We return now to the initial moment in this experiment. The 
body is lying on the pedestal and the pedestal has just begun to 
rotate. If there were no interaction between the body and the 
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pedestal, the body would remain in place and the pedestal would 
rotate under the body. The presence of static friction prevents 
this from happening. The body rotates together with the pedestai. 
Moreover, as was indicated in the previous section, the static fric- 
tional force will be directed inwardly along the radius. The static 
frictional force is the only force impelling the body to rotate, ie., 
the frictional force in this case is a centripetal force. Therefore, 


F jr F centrip- 


The centrifugal force is exerted by the body on the pedestal and 
is thus directed outwardly along the radius. If for the purpose of 
Clarity (it should be remembered, however, that this is a very gross 
picture) friction is conceived as being due to the engagement of 
two rough surfaces, whereby the surface protuberances of the first 
body mesh with those of the second, then the centrifugal force 
constitutes a force acting outwardly along the radius at the meshed 
points of the pedestal surface. 

The frictional interaction maintaining the body fixed with re- 
spect to the pedestal must be less than a certain maximum F?¢-, 
In increasing the velocity of rotation of the bowl, the value of 
mo’R finally becomes greater than F7%, which makes it impossible 
for the body to execute circular motion with acceleration |a|=o’R. 
Indeed, to secure circular motion with angular velocity , a force 
mw*R must act on the body. If the frictional interaction cannot 
provide this force and, hence, motion in a circle of radius R with 
angular velocity , the body moves with respect to the pedestal 
and static frictional interaction ceases to exist between the body 
and the pedestal. 

- As soon as interaction between the body and the pedestal ceases 
and the body becomes free, rectilinear and uniform motion begins, 
the velocity being that possessed by the body at the moment of 
dissociation. Since the velocity of a body moving in a circle is 
directed along the tangent, this line represents the line of motion 
of the freely moving body. 
The tangential path of par- 
ticles dissociated from a 
rotating body is very clear- 
ly demonstrated in the case 
of particles flying from a 
rotating grindstone. 

Let us now consider the 
rotation of a stone tied to 
the end of a string (Fig. 
11). In order to uniformly 
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rotate the stone at the end of the string under normal conditions, 
tangential acceleration as well as centripetal acceleration must be 
imparted to the body. The tangential acceleration is necessary 
in order to overcome the friction with the air. The resultant ac- 
celeration—and, hence, the force—is not directed along the radi- 
us, but forms an acute angle with the direction of motion. The 
hand executes rotational motion and the string is directed at each 
instant along the tangent 
to the circle described by 
the hand. 

As another example of 
circular motion, let us con- | 
sider the rotation of two 
attracting bodies having the 
same angular velocity about 
acommon centre. By means 
of a centrifugal machine, 
it is not difficult to make 
two bodies of equal mass, 
joined by a string, revolve 
about a common axis. 

To begin with, let us Fig. 12 
consider the first body, 
with a string attached to the rotating shaft. The centrifugal force 
acting on the shaft is equal to m,*R,. Similarly, the second body 
acts on the shaft with a force m,w’R,. If these forces are equal, 
the strings could be joined to each other as shown in Fig. 12, 
for nothing would change thereby. It is thus clear that the condi- 
tion for stable rotational motion of two bodies joined by means of 
a string is the equality of the centrifugal forces exerted on the 
string by these bodies: 





Thus, ee 
my, Re 
mR,’ 


ie., stable rotation takes place only when the ratio of the distances 
to the axis of rotation is inversely proportional to the masses ‘of 
the bodies. 

Mm, 


The point dividing the distance R,+R, in the ratio e=2 


(Fig. 12) is called the centre of mass (see Sec. 15). It can be stat- 
ed that stable rotation of two joined bodies takes place about the 
centre of mass of the system. 
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We have spoken about two bodies whose interaction is achieved 
by means of a string. However, the above is also completely valid 
when two bodies attract each other in accordance with the law of 
universal gravitation, or when a positive and a negative charge 
attract each other. Thus, interaction of any kind between two 
attracting bodies can produce stable rotation about the centre of 
mass of the system. This interaction is given by two forces applied 
to the attracting bodies. The forces are oppositely directed but nu- 
merically equal. (At this point, the unsophisticated reader will 
usually ask: Why do the bodies not attract each other? We repeat: 
The forces are parallel to the accelerations, not to the velocities, 
and in circular motion the acceleration is directed along the radius 
toward the centre of rotation.) Since a single force acts on each 
body, both are centripetal forces. At the same time, both are also 
centrifugal forces. Thus, body A acts as a constraint for body B, 
and vice versa. In other words, for body A, Fp, is a centripetal 
force while F,, is a centrifugal force, and vice versa for body B. 
However, the concept of centrifugal force is used here in a com- 
pletely formal sense. It was introduced only in order to emphasise 
the similarity existing between a ‘system of spheres joined by a 
string and a system of bodies.“joined” by a force of attraction. 

A planetary system is an example of stable rotation of attracting 
bodies. Let us assume that the Sun had only one planet—the 
Earth. The centre of rotation would then divide the line joining 
the Sun and the Earth in the ratio msun:mearth = 330,000: 1. 

Thus, when it is ordinarily said that the Earth rotates about the 
Sun, we are not committing a serious error, and this would be so 
even if the Earth were the Sun’s only planet. 


G. The Effect of the Earth’s Rotation on Mechanical Phenomena 


The motion of the Earth is complex. It revolves about its axis 
and, at the same time, moves in an orbit about the Sun. Hence, 
it is clear that the Earth does not constitute an inertial frame of 
reference. Nevertheless, under conditions prevailing on the Earth, 
Newton’s law is generally quite satisfactory. In a number of cases, 
however, the noninertial property of the Earth’s frame of refer- 
ence has an appreciable effect on the phenomena being studied. 
These cases should be investigated. 

Effect of the Earth’s Rotation on Its Form. Weight of a Body. 
If the Earth’s rotation is not taken into consideration, a body 
lying on the surface of the Earth can be considered at rest. The sum 
of the forces acting on this body would then be equal to zero. 
As a matter of fact, any particle on the Earth’s surface lying at 
latitude @ moves with an angular velocity w= 0.7292 x 107* sec™* 
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about the globe’s axis, i.e., in a circle of radius r= Rcos@ (R is 
the radius of the Earth, which is assumed to have, to a first 
approximation, the shape of a sphere). Therefore, the sum of the 
forces acting on such a particle differs from zero. It is equal to the 
product of the mass and the acceleration, wR cos q, and is directed 
along r. 

it is clear that the presence of such a resultant force OG 
(Fig. 13) is possible only when the reaction of the Earth’s surface 
OA and the gravitational force 
OE are directed at an angle 
to each other. The body will 
then press on the Earth’s sur- 
face (according to Newton’s 
third law) with a force OC = 
=:— OA. If the globe were at 
rest, this force would be equal 
io the gravitational force OF 
and would also coincide with 
its direction. 

Let us resolve the force 
OC into two components — 
one force directed along the 
radius OD and the other along 
the tangent OB. As can be seen 
from the figure, the Earth’s 
rotation results in two effects. 
First, the weight (the body’s 
pressure on the Earth) becomes 
less than the gravitational 
force. Since OC = OD, this decrease equals DE = mR’ cos’ q. 
Secondly, a force is produced tending to flatten the. Earth, i.e., 
shift matter toward the equator. This force OB = mRw’* cos @ sing. 
Such flattening has actually taken place, for the Earth’s shape is 
not spherical but close to an ellipsoid of revolution. As a result of 
this effect, the equatorial radius of the Earth is 1/300 greater than 
the polar radius. 

The flattening force tended to redistribute the mass of the globe 
as long as the latter’s form was not in a state of equilibrium. 
When this process was completed, the flattening force eviderttly 
ceased to be effective. Hence, the force exerted on the terrestrial 
“globe” is directed normal to the surface. 

Let us now return to the quantity expressing the pressure of the 
body on the Earth, i.e., to the physical quantity generally called 
weight. The calculation made for a sphere (gravitational force mi- 
nus mRw’ cos’ @) is naturally not valid for the actual shape of the 
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Earth. However, for approximate calculations, this value can be 
used. 

At the poles (p=90°), the weight of the body is equal to the 
gravitational force. Let us denote the gravitational force of the 
body at the poles by mg. As indicated above, the pressure exerted 
by the body on the Earth’s surface at any point of the globe, in 
other words, the weight of the body, will be equal to the difference 
between the gravitational force and the force DE, i.e., 

mg —mRo’ cos’ p= mg". 
Thus, 
g’ = g— Ro’ cos’ @ 

is the acceleration with which a body falls at the latitude @. At the 
equator, g’ is 1/300 less than g. 

_ If we use an appropriate value for the acceleration of a freely 
falling body at each of the various latitudes, we do not have to 
calculate the effect of the Earth’s rotation on the weight of the body. 

Effect of the Earth’s Rotation on the Motion of a Body on the 
Earth’s Surface. Let us assume that the motion of a body is ob- 
served in a rotating system of coordinates. The body moves rectili- 
nearly and uniformly past the observer, and the motion is curv- 
ilinear in the selected noninertial frame of reference. Coriolis, 
the French scientist, showed by means of calculations that relative 
to a system rotating with angular velocity @ a body moving recti- 
linearly and uniformly with velocity v has an acceleration equal 
to 2umsina, where a is the angle between the axis of rotation and 
the direction of the rectilinear 
motion. The acceleration is 
directed perpendicular to the 
plane passing through the axis 
of rotation and the direction 
of the velocity. We may use 
the following rule ‘to deter- 
mine which of the two pos- 
sible directions is that of the 
acceleration. If one looks along 
the axis of rotation in the 
direction that makes the rota- 
tion appear counterclockwise 
and places his left hand palm 
down with the fingers pointing 
in the direction of the rectili- 
near motion, the thumb will 
point in the direction of the 
Fig. 14 acceleration (Fig. 14). 
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The Coriolis acceleration aco, acts on all bodies moving on the 
Earth’s surface. If one looks along the Earth’s axis from the North 
Pole, the rotation appears counterclockwise. Hence, in the Northern 
Hemisphere, any body moving rectilinearly relative to an inertial 
system will deviate to the right (as viewed by a terrestrial observ- 
er) in the course of its motion, while in the Southern Hemisphere, 
it will deviate to the left. This deviation could be large or small, 
depending on the direction of motion with respect to the axis and 
on the linear velocity of the motion. 

The deviation of the body can take place in the horizontal or in 
the vertical plane (with respect to the surface of the Earth). The 
Coriolis acceleration is directed perpendicular to the Earth’s axis; 
hence, the deviation taking place in the horizontal plane is greatest 
at the poles and equal to zero at the equator. The reverse is true 
for deviations in the vertical plane. The deviations in these two 
planes determine the corresponding projections of the acceleration 
vector. Thus, the projection of the acceleration of the body in the 
horizontal plane is 

2uo sin @, 


where g is the latitude. In the Northern Hemisphere, this projec- 
tion is directed to the right of the motion. 

The deviation of bodies moving in the horizontal plane from 
their rectilinear path is the reason why the right banks of rivers 
are eroded in the Northern Hemisphere, and the left banks in the 
Southern Hemisphere. For the same reason, rivers in the Northern 
Hemisphere by-pass obstacles to the right and in the Southern 
Hemisphere to the left. 

Air masses flowing into regions of low pressure deviate from the 
radial direction to the right in the Northern Hemisphere (to the 
left, in the Southern Hemisphere) and form cyclones. Thus, cyclones 
in the Northern Hemisphere stir the air masses counterclockwise, 
and in the Southern Hemisphere clockwise. 

As a result of the vertical deviation, a falling body does not 
fall exactly vertically. Such a body deviates from east to west 
(the Earth rotates from west to east, i.e., counterclockwise if 
viewed from the North Pole). 

Examples. 1, Let us calculate the maximum deviation of a normal artillery 
shell. The deviation will be a maximum at the poles (p=90° and for all firtng 
directions a= 90°). If we take the velocity of the shell to be 1 km/sec, we 
obtain a deviation of 2x1,000x0.73x1074+0.15 mjsec*. The gravitational 
acceleration is about 70 times greater than this acceleration. As can be seen, 
the deviation of the shell from its rectilinear path can attain a magnitude of 
the order of several centimetres per second. : 

2. Assume that a river is flowing from north to south (in the Northern 
Hemisphere) with a velocity v=3 km/hr. Thus, the water moves from a region 
of small linear velocity of rotation of the Earth’s surface to a region of larger 
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linear velocity. This increase in the velocity of motion (directed from west to 
east, together with the banks of the river) is determined by the Coriolis acceler- 
ation and is due to the action of the right bank of the river on the mass of 
water. Let us calculate the Coriolis acceleration for the latitude p= 45°: 
ACor = 2UW, Sin P, 
@) == 2n rads/day=7.95x 1075 rad/sec, u=3 km/hr=0.83 m/sec, 
ACop = 2X 83 X7.25 X 1075 X 0.707 = 850 X 1075 = 8.5 1073 cm/sec’. 


Thus, on every ton of water the right bank exerts a force of 


8.5 X 1079 Jos 


380 gm=8.7 gm. 


The steep right banks of the Volga, Don and other big rivers of the Northern 
Hemisphere illustrate this effect. 


7. Data Necessary for the Solution of Problems in Mechanics 


The basic problem of mechanics is the determination of the motion 
for given forces. To determine the motion means to be able to 
indicate the location in space and the corresponding instant of 
time for any of the particles. If we are concerned with a complex 
mechanical system, then such data are necessary for each of the 
particles into’ which this system can be considered divided. 

In order to tackle such a problem, we must, in the first place, 
have complete data on the effective forces. The forces must be known 
for every particle and for every location of this particle. If these 
forces are known, then by means of Newton’s equations we can 
determine the acceleration of the particle. However, Newton’s equa- 
tions of motion alone are insufficient to completely determine the 
path, the velocity and the instant of time corresponding to the 
passage through a given point in space. To describe the motion, 
it is necessary to know for each instant of time the location of 
the particle and the magnitude and direction of the velocity. In 
all, six quantities must be given: three coordinates and the three 
projections of the velocity on the axes. These data uniquely describe 
the “mechanical state” of a particle and may be called the para- 
meters of state. . 

Thus, the problem reduces to the determination of the parame- 
ters of state, for Newton’s equations only give the acceleration. 

To solve the problem, the initial conditions must be known, i.e., 
the values of the parameters of state for some instant of time (this 
instant is usually designated by ¢=0, whence the designation “ini- 
tial conditions”). If the initial values of the parameters of state 
are known, the rest is merely a matter of mathematics. Newton’s 
equations of motion plus the initial data suffice to uniquely solve 
the mechanical problem. In principle, the future motion of the par- 
ticle as well as the past motion can be established for any desired 
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period of time. This concept amazed scientists at one time. La- 
place, the great French scientist and thinker, once said: If we knew 
the initial coordinates and velocities of all the particles comprising 
the world, we would be able to predict the fate of the world. This 
somewhat naive viewpoint, reducing all reality to purely mechanical 
phenomena, is not valid in principle, and not merely because it is 
practically impossible to obtain the required data. Mechanics, based 
on Newton’s laws, has limited application and its conclusions cannot 
be applied that broadly. 

Let us return, however, to the six initial conditions. The need 
for giving precisely six quantities for a particle is evident from 
Newton’s equations themselves. 

The vector equation can be resolved into its components and 
written in the form of three equations: ma,=F,, ma, =F, and 
ma,=F,. To determine the motion, it is necessary to establish how 
the particle’s three coordinates x, y, z vary with time. In order to 
establish the dependence of the coordinate x on time, we must 
integrate the equation 

duyz 
mF = Fy. 

The first integration enables us to find the x-component of the 
velocity. Upon integrating, we obtain the first constant of integra- 
tion. The second integration enables us to find the coordinate x as 
a function of time, and the second arbitrary constant is obtained. 
The above also holds true for the equations of change with respect 
to time for the other two coordinates. In all, six arbitrary con- 
stants are obtained. These may be determined only if six independ- 
ent facts about the coordinates and velocities of the particle 
are known. 

As we have indicated, the initial conditions consist of the three 
initial coordinates and the three projections of the initial velocity. 
However, the problem could also be solved if six other quantities 
are known. For example, we may be given the three coordinates 
of the initial point, the numerical value of the initial velocity, 
and two coordinates of the final point. The path of the particle 
is also uniquely determined by these six conditions. 

The parameters of the particle may be given in a variety of 
ways. The location of the particle in space may be given by three 
Cartesian coordinates or by the distance from the origin of the 
coordinate system and two angles formed by the radius vector with 
_the axes. Similarly for the velocity. 

A typical example of the dependence of a body’s motion on the 
initial conditions is the behaviour of a rocket fired from the sur- 
face of the Earth. The trajectory of the rocket and its destiny is 
determined by -the firing direction, the geographical location of the 
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launching site and the magnitude of the initial velocity. As is well 
known, for small firing velocities from the Earth, a body has 
a parabolic trajectory. For a velocity of about 8 km/sec, equilibrium 
is achieved between -the centrifugal force and the gravitational 
force, and the launched body may be placed in a circular orbit. 
For velocities between 8 and 11.2 km/sec, the launched body moves 
in an elliptical orbit about the Earth. At an initial velocity of 
about 11.2 kmjsee, the kinetic energy of the body becomes suffi- 
cient to completely overcome the Earth’s gravitational attraction. 
A rocket launched with such a velocity will have a hyperbolic 
trajectory. 

If the mechanical system consists of nm independent points, the 
number of parameters for the system will be equal to 6n. 

In some cases, however, constraints which serve to decrease this 
number may be placed on the mechanical system. A simple example 
is a centrifugal regulator, which may be considered as a system 
consisting of two joined spheres that can slide apart and turn about 
a common axis. It is clear that, given the distance of a point from 
the axis of rotation and the azimuthal angle with respect to an 
arbitrary line, we can uniquely determine the mechanical state of 
the system. Two “coordinates” and two velocities of change of these 
coordinates constitute the parameters of this state. 

Let us now consider an arbitrarily rotating solid body and deter- 
mine the data required to fix its, position with respect to a station- 
ary system of coordinates. It is clear that the centre of mass of 
the body is determined by three quantities. To describe the body’s 
rotation, three angles suffice. We need not elaborate on this point, 
for it is evident that by means of three rotations about mutually 
perpendicular axes any desired orientation of a body can be achieved. 

Thus, the solid body requires twelve parameters—six coordinates 
and six velocities of change of these coordinates. 

As another example, let us consider two rigidly joined points. 
If they were free, six coordinates would be required to describe 
them. Since they are rigidly joined, an additional condition relating 
the coordinates of these points exists, namely: 


(x, —x,)? US) +(2, —z,)? = const. 


Thus, five independent quantities are required to describe this system. 
In all, there are ten parameters—five coordinates and five veloci- 
ties of change of these coordinates. 
Since the parameters of state are always equally divided between 
“coordinates” and velocities of change of the “coordinates”, it is 
customary to speak of the degrees of freedom of a system, whereby 
we mean the number of independent coordinates required to describe 
the system. Thus, one point has three degrees of freedom, two 
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rigidly joined points have five degrees of freedom, a solid body— 
six degrees of freedom, a system consisting of n independent points— 
3a degrees of freedom, etc. The meaning of the following proposition 
should now be clear: The mechanical state of a system is described 
by giving its parameters in terms of the number of degrees of freedom. 


8. Constants of Proportionality and Dimensions 
of Physical Quantities 


The coefficient y in the expression for the law of universal gravi- 
tation is a universal constant depending on the choice of units for 
force, mass and distance. It is possible to choose the units in such 
a manner that y=1. This would require that the unit of mass be 
equal to the mass of a particle attracting a similar mass at unit 
distance with unit force. In the CGS system of units, such a mass 
would be equal to 1.510" gm, i.e., 15 tons. 

Thus, universal constants in formulas of physics depend on the 
specific choice of units. If we desired, we could eliminate all such 
a from formulas of physics by appropriately choosing the 
units. 

It is important to grasp the concept that the employed system 
of units and the constants of proportionality in formulas are inter- 
connected. We can demonstrate this interconnection by dimensional 
formulas. First, the number of units that we wish to consider funda- 
mental must be established. This number depends entirely on us 
and is determined exclusively by considerations of convenience. 

A widely used system of units in physics is based on the units 
of length (L), mass (M) and time (7) as the independent quantities. 
The values of all universal constants and the units of measurement 
of all other quantities are then uniquely determined by the choice 
of units for L, M and 7. The nature of this relationship is given 
by so-called dimensional formulas. Several examples will make their 
meaning clear. The dimensions of velocity are LT~', acceleration— 
LT~-?, force—MLT~?, the gravitational constant —M~-'*L*T~-*, elec- 
tric charge in the formula for Coulomb’s law—M‘:L%:T~-', etc. 
Knowing these formulas, we can immediately say how the numerical 
values of the universal constants and the units of derived physical 
quantities vary when the magnitude of some fundamental quantity 
is changed. 

As we shall see by examples in Sec. 81, dimensional analysis of 
physical quantities can be used to predict the nature of some depend- 
ence or other between physical quantities. 

In-addition to the system based on distance, time and mass, 
a:system in which the fundamental quantities are distance (L), 
time (T) and force. (F) is widely used. This is known as the FLT 
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system. Naturally, the dimensional formulas in this system will not 
always be the same as above. Thus, moment of force in the FLT 
system has the dimensions FL, and in the MLT system the dimen- 
sions ML’?T~*. Mass, being a derived quantity in the FLT system, 
has the dimensions FL~'T?. 

The fundamental law of mechanics relates the quantities of force, 
‘mass, distance and time. Therefore, the value of the constant of 
proportionality in this formula depends in both systems on the 
choice of units. In both systems, a constant of proportionality 
equal to unity is assumed. This means that in the MLT system, 
using the formula F=ma, the unit of force is chosen so that F=1 
when the mass and the acceleration are equal to unity. In the FLT 
system, using the formula mat, the unit of mass is chosen so 
that m-==1 when the force and the acceleration are equal to unity. 

In this book, we shall for the most part use two variants of 
the MLT system: 

CGS system: L—-centimetre, M—gram, T—second; 

MKS system: L—metre, M—kilogram, T—second. 

In the CGS ‘system, the unit of force is the dyne= 1 gm x cm/sec? 
and the unit of work is the erg=dynexcm. In the MKS system, 
the unit of force is the newton=1kgxm/sec* and the unit of 
work is the joule = newton x metre. 

If the reader is confronted with data expressed in the FLT sys- 
tem, these data should be converted into one of the indicated 
systems. To do this, he only need recall that a unit of force in 
the FLT system is a kilogram (the weight of a kilogram mass at 
sea level at 45° latitude), which is related to the two units of 
force adopted by us as follows: 


1 kg=9.81 newtons= 9.81 x 10° dynes. 


We shall return to the subject of systems of units when we con- 
sider electrical quantities. 


CHAPTER Ill 


MECHANICAL ENERGY 


9. Work 


Motion without acceleration (i.e., rectilinear and uniform) may 
take place either without the action or with the action of forces on 
the body. In the latter case, the sum of the forces acting on the 
body is equal to zero. There is an essential difference between these 
two kinds of motion. In the first case, motion is not accompanied 
by work, while to achieve the second type of motion work must be 
expended. A motor works, moving an automobile uniformly and 
rectilinearly. A man works, moving a sleigh with its load uniformly 
and rectilinearly. We say in these cases that work is expended in 
overcoming resistance—friction, air resistance, etc. 

Of the two balanced forcés acting on a body moving without 
acceleration, one is directed along and the other opposite to the 
direction of motion. 

We say that a force acting along the direction of motion per- 
forms work. On the other hand, as regards a force directed opposite 
to the motion, we say that work is performed against this force. 

For quantitative evaluation, work is expressed as the product of 
the force acting on a body and the distance traversed by the body. 
The term work is the designation ae 

: : : 7 
for this physical quantity. 

Let a body be acted upon by 
a number of forces whose vector Ff F 
sum is equal to zero. The body 
moves uniformly and rectilinear- 
ly. All the forces may then be 


resolved into four components Fe 

(Fig. 15). Forces F, and F, in Direction of mation 
accordance with the adopted defi- a 
nition, perform no work. Force Fig. 15 


F performs work equal to FAS, 
-where AS is the traversed distance. The work of force F’ is equal 
to—FAS, where the minus sign indicates that work is performed 
against the force F’. a 

Let us now consider the motion of a body with acceleration, 
ie., curvilinear and nonuniform motion. As we already know, a 
resultant force acts on the body, in this case, that is directed along 
the acceleration (but not along the path in the general case!). Let 
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us again resolve all the effective forces into forces directed along 
the motion and perpendicular to the motion (Fig. 16). Now, F is 
not equal to F’, and F, is not equal to F,. Using the definition for 
work given above, we can assert, as before, that F, and F, perform 
no work. The work of force F’ is again negative, i.e., the work is 
performed against the force F’ and is 
equal to F’AS. Force F performs the 
work FAS, which is more than the 
* work against the force of resistance. 
The surplus work serves to accelerate 
the body. 

The inequality between the forces F, 
Fa. and F, shows that the motion is curvi- 
linear. Their difference F,—F, cor- 
responds to the normal component of 
the acceleration. 

Fig. 16 Let us consider an extreme case— 

uniform motion in a circle. The result- 

ant force for such motion is directed, as we know, along the ra- 

dius of the circle, i.e., perpendicular to the direction of motion. 
Therefore, the centripetal force performs no work. 

Thus, the surplus work in the general case of curvilinear accelerat- 
ed motion is not used to produce acceleration in general, but only 
the tangential component of the acceleration. For a particle, this 
can be expressed as follows: 


F—F’=ma, and FAS—F'AS=ma,AS. 


Fy 





a ace el 
Direction of motion 


We repeat: (F—F’) is the tangential component of the resultant 
force F;°’. 

The work expended in accelerating a body (being equal, by defini- 
tion, to the projection of the resultant force on the direction of 
motion multiplied by the traversed distance) is equal to the product 
of the mass of the body, the traversed distance, and the tangen- 
tial acceleration. The last equation above can be written in the 
form FAS=F’AS-++ma,AS and can be read as follows: The work 
performed by the effective force consists of the work against the 
force of resistance and the work expended in accelerating the body. 


Examples. 1. A jet passenger plane, having a weight P=70 tons, attains a 
height h=10 km. If it moves uniformly, the work performed in rising to this 
height is 


A,=Ph=7 X 108 kg-m= 68.6 X 10° joules= 68.6 X 10" ergs. 


If this height is attained over a path S=85 km with a simultaneous increase 
in velocity (acceleration a=0,3 m/sec?), the additional expenditure of work in 
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producing acceleration will be 
A,= ma X S=17.9 X 108 joules=17.9 & 10'® ergs = 1.82 108 kg-m. 


2. To plane a board 2 metres long and 20 centimetres wide, a joiner expends 
about 150 kg-m of work. 


10. Kinetic Energy 
Thus, in accelerating a body, the resultant force F’’* performs 
the work 
A= F;* & AS= ma, AS 
where a, is the average tangential acceleration over the portion of 
path AS being considered. Substituting for a,, we obtain 
A= mares AS 


where uv is the average velocity which is equal to */, (v,+0,). 
Here v, and v, are the instantaneous velocities at the’ beginning 
and the end of the path anaes Since Av=v,—v,, then * 


2 
mv, mv oe 
ae = ; 


5 ue work is numerically equal to the increment in the value 
wg me . Therefore, the quantity 


= mu xX Av, 


_ mvt 
me 


is employed as a measure of the energy of motion of a particle. 
This quantity K will be called kinetic energy. The previous equation 
may now be read as follows: The work of the resultant force acting 
on a body (i.e., the product of the tangential component of the 
resultant force and the traversed distance) is equal to the increment 
in the kinetic energy of the body. This equation is convenient for 
the solution of elementary mechanical problems in which the path 
along which the force acts is given. 


* The same result is obtained if we write the expression for an infinttely 
small quantity of work in the form dA=mvudv and integrate it from the 
moment the velocity was v, to the moment it reaches v,: 


Va 2 2 


mo, mo, 
A= mo dv=—-—z 


e 
1 
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We shall repeatedly be dealing with the term “energy”. It is one of the 
most important physical concepts. Energy, i.e., work capacity, is a 
function of the state of a body. Work is produced at the expense of a 
decrease in the value of this function. Kinetic energy is a function of the 
state of motion. If the kinetic energy changes from K, to K,, then the 
work performed thereby is equal to K,—K,, independent of the 
nature of the motion. It is of no importance whether the velocity 
changed rapidly or slowly, uniformly or nonuniformly. The de- 
crease in the kinetic energy by a specific amount always yields the 
same amount of work. 

Only in the case when the physical quantity is a function of 
state can it have the sense of energy, i.e., a store of work. 


Examples. A unit of energy in atomic physics is the electron-volt (ev). This 
: ue kinetic energy. of an electron accelerated through a potential difference of 
volt: 


lev=1.6 X 107! erg. 


The energy of a proton accelerated in a synchrotron is 10 Bev=10'° ev=0.016 erg. 
The kinetic energy of a large jet passenger plane (m=100 tons and 
v=800 km/hr) is 


2.5% 10" ergs=2.6 X 10° joules =2.5 X< 108 kg-tn. 


11. Potential Energy 


Let us consider several phenomena in which the performed work 
is not accompanied by a change in the velocity of the body. We 
shall be concerned with two types of problems. The first is related 
to the elastic deformation of a body, while the second deals with 
events occurring during the motion of a body in a gravitational or 
electric field. We shall presently show that in both cases we shall 
be dealing with the transformation of work into a special variety 
of energy, so-called potential energy. 

Elastic deformation phenomena will be treated first. Experiments 
show that for any elastic deformation—extension, compression, 
flexure, etc.—one can always find a function of state thal increases 
precisely by the magnitude of the work performed on the body. 
This function of state or, in other words, the function of the body’s 
properties and degree of deformation is called the potential energy 
of elasticity. 

We shall show this energy to exist for a case of elastic deforma- 
tion, namely, linear extension or compression. Analogous examples 
could be given for any other kind of elastic deformation. 

Let some force, such as a muscular force, stretch a solid body, 
e.g., a spring, very slowly. The work expended in stretching the 
body from length /+s, to length /+-s,, where / is the length of 
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thé unstretched. spring, is 
A=F(s,—s,). 


The muscular force is balanced by the elastic force of the spring 


at every given moment. For a small extension of the spring, the 
latter is proportional to the deformation s*:° 


F,,= ks. 
In the eeu for work, we must use the average value of 
the force F, i.e., > uae then obtain **: 


, the wore goals the elastic force is de in increasing 
the quantity 2 >. This quantity is, therefore, adopted as a measure 
of the elastic energy. The quantity 


ks? 
Un=> 
will be called elastic potential energy. 

Elastic potential energy formulas 1or other kinds of deformation 
have exactly the same form. The body’s stillness with respect to a 
specific form of deformation is characterised by k, while s is a 
measure of the deformation (for example, twist angle, displacement 
angle, etc.). 

The quantity U,, is energy in precisely the sense referred to at 
the end of Sec. 10. Irrespective of the manner in which a body is 
deformed and the rapidity of the process, the same ‘amount of 
expended work will always correspond to one and the same incre- 


mental value of the quantity ae Thus, ks’ is a measure of energy 


2 
or, to be more precise, of elastic potential energy. 


Examples. 1. The potential energy of a piece of steel wire (Young’s modulus 
E= 21,000 kg/mm*) having a length of 50 metres and a cross- “section of 10 mm’, 
and which is stretched | cm, is 


Ya = = = 20 & 10° ergs +2 joules + 0.2 kg-m, 


* It should be recalled that the law of elastic deformation (Hooke’s*law) 
is written in the form Fak, where E is the modulus of elasticity and S 1s 
the cross-section .of the stretched body. Thus, the stiffness (the constant of pro- 
portionality in the expression for the elastic force) has the value k=. 


** The same result is obtained when we integrate the infinitely small amount 
of work dA==—ksds between the limits s, and s,, 


3—3580 


60 Mechanical Energy [Chap. IT 


2. For rubber, Young’s modulus E=8 kg/em® A stone having a mass of 
20 gm is shot from a slingshot to a”height of 20 metres. This requires that an 
energy of 0.4 kg-m be imparted to the stone. Assume that the elastic band has 
an initial length of 40 cm and stretches an additional 40 cm. Let us determine 
the required cross-section for the elastic band, 


ES s® 21U 2 40(cm) X40 (kg-cm ie 
Us=F Fh Sm ES (ie) ea =<) 0.25 cm?, 
cm? xi, 0 (cm?) 





Gravitational force possesses the same feature (discussed above) 
as elastic force. Thus, work expended in lifting a body in a gravi- 
tational field serves to change the body’s function of state. In this 
case, the function interesting us depends on the position of the 
given body with respect to the bodies attracting it, This function 
is called gravitational potential energy. | 

We shall show this energy to exist, first, for a body located 
close to the Earth’s surface. From point 1, the body is moved to 
the higher point 2 along some curvilinear path. Let us divide this 
path into small segments, replacing the curved line by a broken 
line. The latter can be made to approximate the former to any 
desired accuracy. The work expended in moving a body along one 
of these linear segments of length d/ is then 


dA=mgdlsina or dA=mgdh, 


where dh is the increase in height. Since mg does not change along 
the entire path of motion, we can place it before the brackets (be- 
fore the integral sign when integrating) in writing the expression 
for the work expended along the entire path: 


A=mg (A, —h,), 


where A, and h, are the heights of points 1 and 2 respectively. 
Furthermore, 


ie., the work of displacement is equal to the increase in the prod- 
uct mgh, which is a measure of the gravitational potential energy 
for this simple case. 
It is quite evident that 
U=mgh 


is energy and is in complete accord with the meaning we have 
assigned to this term. Irrespective of the manner in which the work 
is performed, i.e., the path taken by a body and the speed of the 
motion, the work of displacing the body from point 1 to point 
2 will always be the same, since the increase of energy depends 
only on the location of these points—in our simple case, on their 
Heights. 
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Since the work of displacing a body in a gravitational field does 
not depend on the path, the work of displacement along a closed 
curve will be equal to zero. 

It should be noted that it is immaterial what level we choose 
as our base for A. If it is agreed to calculate A from the Earth’s 
surface, the potential energy of a body at the bottom of a well 
will be negative. 

The above formula is not valid for bodies that are far from the 
Earth, e.g., the Moon. Thus, as was explained in Sec. 2, mg, the 
approximate expression for the gravitational force, should be replaced 


for large distances by the exact expression yor. 

Let us calculate the work done by the gravitational force. Work 
performed by the forces of a system will be considered positive, 
while work against the forces of the system will be considered 
negative. Let us assume that two attracting bodies draw together 
along the line of action of the forces over an infinitely small segment 
—dr of the path (minus, since r decreases). Thus, 





dA=—y dr. 
But a= d(—). Therefore, 
dAs=—a (—y mt 


Work takes place at the expense of a decrease in the value of 
mM, 





U=—y nae which is a measure of the gravitational energy in the 
general case: 
dA=— du, 
The quantity 
U=—yo 


c 


represents gravitational .potential energy in the general case. 

U is equal to zero if the bodies are infinitely far apart. When 
the bodies draw together, U increases in absolute value. But since U 
is negative, we see that, just as with the approximate formula for 
bodies close to the Earth, the potential energy is less the cleser the 
attracting bodies are to each other. Naturally, if we desired, we 
could change the base line for U and make this quantity positive 
in the interval of values concerning us. 

It is not difficult to show the relationship between the general 
formula for U and its particular case when U =mgh. Thus, replacing 


3* 
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r by R+4A, where R is the radius of the Earth, we obtain 
1 





ya wim RN 
RFR 
R 


(M is the mass of the Earth). But since 4 is a small quantity, 


we can write with sufficient accuracy — =| 7 , whence 
R 
U=—y um ++ mgh. 


Changing the base line for U so that zero potential energy for 
a body is at the Earth’s surface, the formula reduces to U=mgh. 


Example. To obtain a clearer picture of the meaning of the above results, 
let us calculate the potential energy of a body of mass m=1 kg at the Earth’s 
surface and at a distance of 1,000 km above its surface. 

The potential energy at the surface of the Earth is 


Mm 1 5.8X1027x 108 
UN eS Tagite se. sxior OS CR 


The potential energy at a height of 1,000 km is 
1 5.8 1027x108 
U seg TDx10° 7.3x105 ==— 5.3104 ergs. 


From the caiculations, it is evident that 1) the potential energy of a body, 
in the Earth’s gravitational field is always negative and increases with its 
distance from the Earth (since we have agreed that it tends to zero when 
h—> oo); 2) the change in the potential energy of a body rising above the 
Earth’s surface is, generally speaking, not described by the expression 
mg(h,—h,). Thus, 


Uy goo Uo=— 5.3 10" — (— 6.1 x 10") = 0.8x 10" ergs = 816,000 kg-m, 


while the calculation with the cxpression mg(h,—A,) yields 1,000,000 kg-m. 
However, when we are concerned with ascensions to a height h<R (R is the 
radius of the Earth), it is permissible to use the simplified expression mg (hz—h,). 


The formula for the potential energy of the electrical interaction 
of charges is very similar to that for gravitational potential energy. 
Let us consider two electrical charges q, and q, having the same 
sign and separated by a distance r. According to Coulomb’s law, 
the particles will repel each other. Therefore, in reducing their 
separation to the small distance dr, we perform work equal to 


—dA=—4P dr (the minus sign is used in the left-hand member 


because the work is performed against the forces of the system; 
the right-hand member also has a minus sign because the distance 
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is decreasing and dr is negative). The calculation, which in no 
way differs from that for gravitational force, yields for the energy 
of electrical interaction of a (for brevity called Coulomb 


energy) the expression U=oe, j i.e., here too dA=—dU. 


The interaction energy of cee dente opposite signs is nega- 
tive and behaves like gravitational energy. The interaction energy 
of charges having the same sign is equal to zero when the charges 
are separated by an infinite distance; it increases as the charges 
are brought together. 

We shall restrict ourselves to these examples of potential energy, 
although in various cases other functions of state of a body may 
be introduced. 

Potential energy always exists when forces act between bodies 
or particles of the system under consideration that depend on the 
distance between the bodies. Potential energy is the interaction 
energy of the bodies. If a system consists: of a number of bodies 
or particles, we can then speak of its total potential energy, i.e., 
the interaction energy between all the particles (each with all the 
rest). Thus, in the case of four particles, the potential energy is 
composed. of six. terms, for we must consider the interaction be- 
tween the first body and the second, third and fourth; the second 
body and the third and fourth; and finally the third body and 
the fourth. 

In mechanics, only the potential energy of forces acting between 
different bodies is considered. If a body is complex ‘and consists 
of many particles, the interaction potential energy of these parti- 
cles is considered to remain unchanged during the mechanical proc- 
esses. The interaction. potential energy of the particles comprising 
the body is a component part of the body’s internal energy 
(Chapter IX). If changes in a body’s internal energy take place, 
the phenomena must be considered in the light o the laws of 
een (Chapter IX). 


12. Law of Conservation of Mechanical Energy 


“Irrespective of the type of forces involved in the motion, the 
work of the resultant force is always equal to the increment of 
the body’s kinetic energy, i.e., 

mv? 
| FAs=A A (3). | 
The forces acting on the body could be elastic forces, gravitational, 
electrical and frictional forces, etc. 

-It is always possible to separate from. the effective forces those 
whose work serve to change the potential energy. For brevity, 
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such forces are sometimes called potential forces or forces possess- 
ing potential. The work equation may be written in the form 
Frot As +-fAs= A (>). 

Here, f represents the nonpotential forces. The work of these 
forces is equal to the change in the internal energy of a body or 
the medium in which the body moves. 

Substituting in place of the work of the potential forces the 
increment of potential energy with reversed sign, we can write 
the equation in the form 


fAs=A (F +U) . 


The sum of a body’s potential and kinetic energy is called total 
mechanical energy. Designating this quantity by @, we obtain: 
fAs=A6, i.e., the change in a body’s total energy is equal to 
the work of the nonpotential forces, e.g., the frictional forces. 

If the work performed in changing the body’s internal energy 
is small with respect to @, the equation simply reduces to the 
following: AG=0 and @==const. This is the law of conservation 
of mechanical energy, which states that the total mechanical 
energy of a body is conserved. 

This law may be easily generalised for a system consisting of 
many bodies or particles. For each body we may write a work 
equation and then combine these equations into one. The total 
energy will then be equal to the sum of the kinetic energies of 
the bodies and the potential energy of interaction: 

m,v% 


Ba gp et. 


If all the interacting bodies are taken into account (such a system 
of bodies is called a closed system), the form of the law remains 
the same as for a single body. The change in mechanical energy 
is equal to the work of the nonpotential forces and, if this work 
is negligible, the total mechanical energy of a closed system of 
bodies remains unchanged, i.e., is conserved. 

The law of conservation of mechanical energy is, on the one 
hand, a consequence of the equations of mechanics (Newton’s law); 
on the other hand, it may be considered as a special case of a 
more general law of nature—the law of conservation of energy 
(Chapter IX). 

Even in mechanics alone, many forms of interconvertible energy 
are met. In considering the motion of a body under the action 
of elastic -forces or gravitafional force, it is easily seen that an 
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increase in the energy of one of the mechanical forms is accom- 
panied by a decrease in the energy of the other form. 

Thus, the gravitational force acting on a falling body decreases 
the potential energy of the body and increases its kinetic energy. 
The reverse is true when a body is lifted to acertain height. The 
elastic force making a ball thrown against a wall rebound decreases 

the potential energy of the compressed ball and transforms it 
into kinetic energy. The reverse takes place when the wall stops 
the thrown ball (the interval from no deformation to maximum 
compression). 

A stretched spring can raise a load to a certain height. On the 
other hand, a falling load can stretch a spring. Thus, elastic 
energy can be transformed into gravitational energy and vice versa. 

The above examples apply to the transformation of one form of 
energy into another in one and the same body as well as to the 
transfer of energy from one body to another. 

It is possible, of course, to transfer energy in the same form 
from one body to another: one load pulls another by means of a 
pulley, a sphere colliding with another transfers part of its kinetic 
energy to it, etc. ’ 


13. Potential Curves. Equilibrium 


The potential energy of interaction of bodies or particles depends 
on their relative distribution, i.e., it is always a function of the 
coordinates or other parameters describing the location of these 
bodies in space. In the simplest cases, the potential energy may 
depend on a single coordinate. 

Let us consider the interaction of two particles whose potential 
energy of interaction is described by the function U(x), where x 
is the distance between the particles. For the sake of definiteness, 
let us assume the particles repel each other with a force F. 
Under the action of the interaction forces, the distance between 
them increases by dx, i.e., an amount of work equal to Fdx is 
performed. This is possible at the expense of the potential energy 
of interaction U, which changes by —dU (decrease of energy). 


Thus, —dU=Fdx or 
dU 
F=o—7, 


i.e., in the case of potential forces the force is equal to minus 
the derivative of the potential energy with respect to x. The nature 
of the mechanical problem is then very simple and is clearly 
described by so-called potential curves, i.e., graphs on which the 
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values of the potential energy are plotted as a function of the 
parameter (Fig. 17). 

In explaining the essence of this graphical method, the analogy 
is usually drawn with the motion of a body on a hill. The mean- 
ing of the potential curve then becomes particularly clear,. for 
the profile of a hill and a potential energy distribution curve that 
is proportional to the height 
A of the hill coincide if drawn 
to proper scale. 

Potential curves consist of 
crests and troughs, steep and 
gradual rising slopes as well 
as steep and gradual declining 
slopes. The form of the curve 
permits us to immediately in- 
dicate on which portions of 
the path a large amount of 
work is performed, on which 
a small amount, and whether 





Forbidden region for . = the work is positive or nega- 
the total energy E> tive in each case. The steeper 
the potential curve, the larg- 

Fig. 17 er the force acting on the 


body. In accordance with the 
familiar geometric sense of the derivative, force is described by 
the tangent of the angle of inclination of the tangential line to 
the potential curve. 

The validity of the formula relating potential energy and force 
is completely evident for those particular cases of potential energy 
that we. have considered. For the potential. energy of a body on 
the Earth’s surface: 





U=mgh and F=— 7 = — mg. 
Fer a body in a gravitational field, in the general case: 
=—y a and Fa— Gay Be. 


For a body subjected to elastic action: 


x? dU 
ua and F=— = — ke, 


For electrical interaction: 
U = its and Faw 
ta 
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- Returning to the potential curve plotted in the diagram and 
keeping the above explanation in mind, we can immediately indi- 
cate where the force is greatest and the points where the force 
acting on the body is equal to zero. The latter points, i.e., the 
positions of equilibrium, are at the bottom of the potential well 
and at the potential peak. The positions where the potentia! 
energy is a maximum correspond to unstable equilibrium, while 
the bottom of the potential well is a position of stable equilibrium. 

We stated above that the form of the potential curve permits . 
us to describe the possible motion of the body. This is not com- 
pletely accurate. In addition to the potential curve, we must also 
know the value of the total mechanical energy of the body. If 
this value is known, we can then indeed deduce from the form of 
the potential curve the possible motion of the body or particle. 

Horizontal lines are drawn in Fig. 17 at the ordinates corre- 
sponding to @, and 6,. If @ is the total energy of the particle, 
then we can determine the kinetic energy as well as the potential 
energy from the curve. The former is the difference between @ 
and U 

The moving particle cannot occupy positions in which the 
potential energy is greater than the total energy. Thus, the hori- 
zontal line @ restricts the possible motion of the body to certain 
portions of the curve. In the case when the energy is represented 
by the lower line @,, the moving point has two possible intervals 
in which it may be located. It may be either in the potential 
well (and have an oscillatory motion there) or on the slope to the 
right of point A, where it will move downwards or upwards 
depending on whether it acquires or loses kinetic energy. 

The above analysis is completely valid for any kind of poten- 
tial curve. Fig. 18 shows several types of such curves. Thus, in 
Fig. 18a, we see the potential curve for a body oscillating on a 


UG) 
U(x) UG) 





a@) 6) 
Fig. 18 
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spring. The oscillating body is in a potential well having symmet- 
rical edges. In Fig. 186, a potential curve is shown that is typical 
for many interacting particles, e.g., atoms and molecules. The 
curve constitutes a potential well, one of whose edges has a very 
steep slope while the other has a gradual one. Plotted along the 
abscissa is the distance between the particles. As can be seen from 
the curve, the potential energy is very large at small distances, 
falls with increasing distance, reaches a minimum, then gradually 
rises tending toward some finite limit. The nature of the motion 
and the bond between two interacting particles are completely 
described by this curve. Two cases should be distinguished. 
The first is when the total mechanical energy of this pair of 
particles is represented by the lower horizontal line @,. The 
second is when the total energy is equal to @,. In the first case, 
the system cannot get out of the potential well. This means that 
the distance between the particles lies between the limits indicated 
in the figure. The mutual motion of the particles can only be of 
an oscillatory character. Such is the situation in a stable diatomic 
molecule. The second case is the reverse of the first. The total 
energy of interaction of the particles is too large for them to be 
constantly linked. The system may get out of the potential well, 
i.e., the bond between the particles cannot exist and the particles 
may fly apart to any distance whatsoever. 

The third potential curve in the figure is a so-called square 
well. Recalling that force is described by the tangent of the angle 
of inclination of the tangential line to the potential curve, we see 
that the potential energy may be represented in the form of a 
square well if the body or particle moves freely without the action 
of forces, yet cannot leave the bounds of the given portion of the 
curve as long as the total energy is less than the height of the 
sides of the well. 


CHAPTER III 
MOMENTUM 


14. Conservation of Momentum 


The product of the mass of a body, or particle, and its velocity 
is known as the body’s momentum (quantity of motion): p=mv. 
The momentum p is thus a vector quantity. In a system of bodies 
or particles, the momentum is equal to the vector sum of the par- 
ticles constituting the system: 


P=p,+Pp.t 


What makes this vector quantity of particular interest to the 
physicist is the fact that in a closed system the vector P does not 
change, irrespective of the motion within the system itself. This 
proposition is known as the law of conservation of momentum. 

The law of conservation of momentum follows directly from New- 
ton’s laws. For each of the bodies in the closed system, the follow- 
ing equation is valid: 


d 
i.e., 
dp 
Ce 


Let us consider what happens when we write such an equation 
for each of the bodies and then add the equations. The right-hand 
member of each equation represents the forces exerted on the given 
body by all the other bodies. Thus, the fcrce exerted on the first 
body is equal to the sum of the forces exerted on it by the second, 
third, etc. Using double indexes, we may write: F,, +F,,+ F,,+ -. 
Similarly, for the forces exerted on the second body, we may write: 
F,,+F,.+F.,+ +3 for the third: F,,+F,,+Fs,+ -..; ete. It 
is not difficult to see that when the right-hand members of the equa- 
tions are added the result is zero. For each term in the first line, 
there is always a term in another line that is equal and oppogite 
to it (in accordance with the law of action and reaction). Thus, 
when F,, and F,, are added the result is zero; also F,, and F,,; 
etc. Therefore, in a closed system, the following equation holds: 


dp, , ap, dp; d 
Pep Oe 0; (PtP test) =0 
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or 
Pi+P.+P; t+: - = const. 


This j is the law of conservation of momentum. The magnitude and 
direction of individual moments may change, but their vector sum 
in a closed system does not. 


Magnitudes of some momenta: The momentum of anelectron with an energy of 
5 ev is ~ 12x 107 20EB XM, that of a rifle bullet is~8X 1057 A 8 Kernels 


sec 
10? kgxmetres 
sec 





and that of a freight train is ~ 


15. Centre of Mass 


The methods of finding the centre of gravity of a body are well 
known. If a body is fixed at its centre of gravity, it is in a state of 
neutral equilibrium. For a system of particles, or a solid body con- 
sidered to be broken up into elementary elements having the size of 
.particles, we can write an analytical expression for the position of 
the centre of gravity. 

Using .the rule for adding parallel forces (Fig. 19), we obtain the 
following expression for the position of the centre of gravity when 
i the particles are considered to 
be distributed along a straight 
line, say the x-axis: 


or MX, EM Xp + MMgXq 
m+-m,+img+... 7” 


Here, x,, x,, X, ...are the coor- 

dinates of the Sg ee and 

X ————+} (m+ + m,, m,, m, ...are the masses. 

(moma *™)9 Masses. are used’ instead of 

weights since the acceleration 

Fig. 19 of the gravitational force can- 
cels out. 

It is shown in theoretical mechanics that for any distribution 

of particles the expression for the position of the centre of gravity 

has the form: 





Ra tale Fare T see 
my +m, -+ ms+ .. : 


where R is the radius vector of the centre and r,, r,, r, are the 
radius vectors of the particles. 

Since the acceleration of the gravitational force cancelled out in 
these formulas, we can conclude that the point found has an 
objective significance that does not depend on the gravitational 
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conditions. It is valid, in fact, even if the body is located in 
interplanetary space under conditions of weightlessness. It makes 
sense, therefore, to replace the prevalent designation “centre of grav- 
ity” by a designation that more accurately expresses the essence 
of the matter. Thus, we speak of the centre of inertia or centre . 
of mass of a body instead of its centre of gravity. 

We shall directly see the full significance of this designation. Let 
us consider the velocity of motion of the centre of mass: 


dR 
V=7- 


Using the formula for the determination of the centre of mass, 
we obtain: 
Ve Ort Mey T MyVy toe 
m+ mM,+ms+ . 





In the numerator we have the total momentum, which is conserved 
‘in a closed system. Thus, the right-hand member of the equation 
is equal to a constant quantity. We can conclude, therefore, that 
the velocity of the centre of mass does not change in magnitude 
or direction. Or, in other words, the centre of mass of a closed 
system of particles executes inertial motion. 

As we already know, all inertial systems of coordinates have equal 
validity. Hence, we can always go over to a coordinate system 
bound to the centre of mass of the system under investigation and 
consider this interesting point as fixed. In atomic physics, we often 
consider collisions between particles. To study this phenomenon, two 
systems of coordinates are used—the laboratory system (the natural 
coordinate system of an observer) and the system bound to the cen- 
tre of mass of the colliding particles. The advantage of the latter 
frame of reference is evident: the total momentum of the particles 
is equal to zero. 


16. Collisions 


The word “collision” should be understood in a somewhat broad- 
er sense than that used in everyday practice. For the mechan- 
ical problems that now concern us, any encounter between two 
or more bodies in which the interaction is of short duration will 
be considered to be a collision. Thus, in addition to the phenomena 
that can be classified as collisions in the usual sense of the word— 
e.g., impact of billiard balls and collisions between atoms and 
atomic nuclei—we have such events as a man jumping on or off 
a street car and a bullet hitting a wall. The forces arising as the 
result of such short interactions are so great that the role of all 
constant forces being exerted is negligible. As a result we are 
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justified in considering the colliding bodies as a closed system and 
we can apply the law of conservation of momentum to them. 

In many collisions, the duration of the interaction is measured 
in thousandths of a second. During this interval of time, the force 
p rises to its maximum value and then drops 
to zero. A typical curve for the force dur- 
ing such an impact is shown in Fig. 20. 
For each instant of time during the impact, 
the relationship between the force exerted 
on either of the bodies and the momentum 
i this body is given by Newton’s second 
aw: 


d 
qi (mu) =F. 


! Rewriting this equation in the form FAt= 

= A(mv), we can say that the product of 

Fig. 20 the average value of the force- and the dur- 

ation of its action is equal to the change 

in momentum. A more accurate description of the phenomenon is 
obtained if we integrate the above equation from the initial instant 
of impact to the termination of the interaction. It is evident that 


t 


{ Fdt = (mv), — (mv),. 


t) 


The integral on the left is sometimes called the impulse of the 
force. In the diagram, this quantity is represented geometrically 
by the area under the impact curve (see Fig. 20). 

There is considerable variation in the nature of collisions, de- 
pending on the elastic properties of the bodies. It is customary to 
consider two’ extreme cases—ideally elastic and absolutely non- 
elastic impacts. 

First, let us consider the latter type. A nonelastic impact 
means an encounter between two bodies whereby these two bodies 
become joined. Examples of nonelastic impacts are collisions be- 
tween clay spheres, a man jumping onto a moving trolley, the 
collision between oppositely charged ions resulting in the formation 
of a molecule, and the capture of an electron by a positive ion. 

Assume that the bodies moved with velocities v, and v, before 
the encounter. Thus, the total momentum was m,v,-+m,v,. After 
the encounter the bodies have a common mass equal to m,+m, 
and move with some velocity V. The momentum of the system 
after the encounter is (m,-+m,)V. Since the law of conservation 
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of momentum requires that 
(m,-+m,) V=m,0,+m,2,, 


{he velocity of the bodies after the nonelastic impact is given by 
the formula: 
__ MD, + M,0, 
v= mytm, * 


The momentum after the encounter should equal the sum of the 
momenta before impact. 

If the motion of the bodies colliding head-on is along a straight 
line, then after impact the bodies will follow the direction of the 
body having the originally larger momentum. If the momenta of 
the bodies are equal in magnitude, m,v,=—m,v,; V is thus equal 
to zero, i.e., the colliding bodies come to a standstill. 

A nonelastic impact is accompanied by a transformation of 
energy. From the example just given, it is seen that the kinetic 
energy may feven become zero. It is not difficult to calculate the 
increase in the internal energy of colliding bodies in one or an- 
other case. All that we need do is perform the following subtraction: 





matity ya (ih, atl 
mui ys (A). 

Let us now consider ideally elastic collisions, i.e., colli- 
sions in which the form of the bodies is completely re- 
stored. This means that no changes occur in the state of these 
bodies, their potential and internal energy before and after impact 
remain unchanged and, consequently, the kinetic energy is con- 
served. For two bodies colliding in this manner, two equations can 
be written that are based on the law of conservation of momentum 
and the law of conservation of kinetic energy. Let us designate 
the masses of the bodies by m and M. We can always make the 
origin of the coordinate system coincide with the position of one 
of the bodies. This simplifies the problem without in any way 
making it less general. Let us assume, therefore, that the body 
having mass M is at rest before impact. The above laws of con- 
servation then yield the following two equations: 


ma=mo-+MV and 5 mu? = mo* ++-MV*. 


Here, « and @ are the velocities of sphere m before ands after 
impact, and V is the velocity of sphere M after impact. 

Let us consider several examples using these equations. First, 
we shall examine the case of noncentral* collision of two spheres 


* The impact is classified as central if the motion of the spheres before 
impact occurred along a straight line passing through the centres of the spheres, 
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having equal mass (Fig. 21). The tnasses cancel out in both equations 
and we obtain 
u=04-V and uw?=vt 2 V?, 


From the vector equation, it is clear that the vector a closes the 
triangle formed by vectors v and V. The equation on the right 
shows that the triangle, for which a is the hypotenuse, must be 
a right triangle. Hence, it follows that 
=p ees oe the velocities, after the collision of two 
nar particles having equal mass, must be di- 
rected at right angles to each otner. This 
interesting conclusion is easily verified in 
billiards, where the directions of miotion 
of the object ball and the cue ball form 
an angle of 90°. In otuer respects, the 
nature of the velocity change is not 
determined by our equations, for they 
do not take into cousideration the de- 
viation of the line of inipact from: the 
lig. 21 line passing through the centres of the 
spheres. 
A complete description of the motion of the spheres after impact 
is obtained if we restrict ourselves to the case of central impact. 
The motion of the colliding spheres will then be along the same 
straight line after impact as before impact. We can dispense, there- 
fore, with the vector notation. keeping in mind, however, that 
a change in the velocity’s sign meats that the direction of motion 
has changed. In this case, there is no need for making the simpli- 
lying assumption of equal masses. The equations for central colli- 
sion have the form 


mu=mvo+MV and mu? =mv’-- MV*. 





Rearranging terms, these equations can be written in the form 
m(u—v)=MV and mu* —v?)= MV? 


Dividitig the latter by the former, we obtain: a--u- Voor 
u-=—(u-- V). Note that the relative velocity of motion of sphere 
m with respect to sphere M before impact (designated by wu) is 
equal in magnitude to the same relative velocity after impact. 

An interesting formula is obtained when we substitute Ve=u +o 
in the formula for the law of conservation of momeuicn We obtain 
an expression for the velocity of sphere sa after impact in terms 
of the velocity of this sphere before impact: 
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unchanged, since no tangential adhesion forces are exerted by the 
wall. As can be seen from the figure, the increment of the mo- 
mentum is numerically equal to 2musina and is directed along 
the normal to the wall. According to the fun- 
damental law of mechanics, at the instant of 
impact, the force exerted on the sphere by the 
wall has the same direction as that of the vec- 
tor of momentum change. The angle of inci- 
dence of the sphere is, therefore, equal to its 
angle of reflection. 

Let us consider an inelastic impact, using as our 
example a ballistic pendulum (a device for measuring 
the velocity of a bullet). A bob containing sand, mass 
M, hangs from a line. The bullet is fired into the 
bob and becomes imbedded in the sand. The momentum 
. of the bullet before impact is mu, and the momentum 
Fig. 24 of the system after impact is (M-—+m)v. Hence, 





v= u. 


m 
m+M 
Acquiring the kinetic energy ue the bob expendsitin rising to the height h,« 
determined by the following condition: 


Mgh="P; ie, h=g (qi) (m@<M). 


If M=10 kg, m=10 gm and u=900 metres/sec, then h=4 cm. 

If we had not used the law of conservation of momentum in determining 
h, but had assumed instead that the total kinetic energy of the bullet had been 
transformed into potential energy of the pendulum, we would have obtained 
the value h=40 metres (!). This means that, in our example, 399.6 kg-m of 
mechanical energy, or 99.9% of the total supply, has “disappeared”, i.e., gone 
to heat the system. Since absolutely .elastic bodies do not exist, mechanical 
energy is not conserved for “elastic” impacts as well, for part of it is trans- 
formed into energy of thermal molecular motisn and is dissipated. We shall return 
to this example in Chapter XI (p. 179). 


Using an example involving collision we shall now illustrate 
the merit of a coordinate system bound to the centre of mass. 
Assume that a sphere of mass m, at rest in a laboratory coor- 
dinate system, is hit by a similar sphere with velocity v. If the 


impact is inelastic, some portion of eae the kinetic energy of the 


system, is transformed into heat. In other coordinate systems, the 
kinetic energy of this pair of spheres is expressed by other quan- 
tities. As regards the heat released, it will be the same for the 
given pair of spheres and is simply determined by the velocity 
of their relative motion. Therefore, instead of resorting to the law 
of conservation of momentum to try to determine the portion of 
the kinetic energy that is transformed into heat, calculated for 
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the laboratory coordinate system, it is sufficient to calculate the 
kinetic energy for a coordinate system bound to the centre of mass. 
Since in such a coordinate system the total mcmentum of bodies 
is equal to zero, after an inelastic collision the spheres come to 
a standstill: all the kinetic energy is transformed into heat. The 
kinetic energy will have a minimal value for a system bound to 
the centre of mass. 

In a coordinate system bound to the centre of mass, the spheres 


move toward each other with velocities 4-v, The kinetic energy 


2 
of each sphere is thus equal to <- mo and the total energy of 
the system is + mo". This is the amount of heat released during 


an inelastic collision. Irrespective of the type of impact, the heat 
(or other form of energy) released due to the kinetic energy of the 
bodies cannot exceed the amount of kinetic energy calculated for 
a system bound to the centre of mass. And conversely, in order 
to release a given amount of heat, it is necessary to have the 
equivalent amount of kinetic energy calculated for a centre of mass 
system. 

Example. A nuclear reaction in which a-particles bombard nitrogen N"™ 
fakes place in accordance with the following equation: 

N#*+ He*—, 0"? + H4, 
The energy absorbed in this process amounts to 1.13 Mev. How much Kinetic 
energy in a laboratory system must an a-particle possess in order for the 
reaction to proceed? At first glance, it seems that 1.13 Mev is sufficient for 
this purpose. But we already know that this is not the case. In a centre of mass 
coordinate system, 1.13 Mev is required, but in a laboratory cocrdinate system, 
more energy is needed. 

Thus, the velocity of the centre of mass is 0. Oa where m,%, is the 
momentum of the first particle and m,v, is the momentum of the second. The 
velocity of the first particle in a centre of mass coordinate system is v7, = v9, — 
— 00> 5m; (1 D2): For the second particle, we may write: v,= 0, — 0, 
1 2 


= Gm, (2M). Hence, the kinetic energy of the system (a, N™) in a centre 


of mass coordinate system is Kem= 5! (01-22), where pag is the 
so-called reduced mass of both particles. We shall consider the nuclei of N™ 
fixed (v,=0). This assumption is justified, since we can always neglect the 
slow thermal motion of the target nuclei as compared with the large velocity 
of the bombarding particles. The kinetic energy in the laboratory coordingte 


system is then Kb= mv; and therefore 


m,+img 
Ktap= Kem ae B 


. The reaction proceeds if Kzm,==1.13 Mev. Since m,=4 and m,=14, we 
obtain 





Kyqp = 1.13 X= 1.45 Mev. 
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17. Recoil 


The law of conservation of momentum helps one to easily un- 
derstand the fundamentals of recoil in gunfire, reaction propulsion, 
and other similar phenomena. 

We shall consider, in the first place, recoil taking place in a frame 
of reference where the bodies are at rest at the initial moment, 
In the case of gunfire, this assumption is in complete accord with 
the prevailing conditions. If at the initial moment a system con- 
sisting of two or more bodies is at rest, the total momentum of 
the system is equal to zero. Irrespective of the future course of 
events, the total momentum continues to be equal to zero. Thus, 
if at some instant an explosion takes place, causing the system to 
be divided into parts having masses m,, m,, m,,..., which fly 
asunder with velocities v,, ¥,, 0,,..., the total momentum m,9, -+ 
+m,0,-+m,0,-+... of the scattered bodies must be, as before, equal 
to zero. 4 

In the case of gunfire (where the system divides into two parts), 
the condition that the momentum of this system of two bodies be 
equal to zero has the form mv+-MV=0. Here, the lower-case let- 
ters refer to one body, say the missile, and the capital letters to 
the other—the gun. The division of the system into two parts can 
only take place along a straight line. We can, therefore, dispense 
with the vector notation and write the condition in the form 
mu=— MV. The velocities of the gun and the missile are inversely 
proportional to their masses. Thus, the greater the mass of the 
missile with respect to the mass of the gun, the greater the ob- 
served recoil. 

The phenomenon of “continuous recoil”, occurring in reaction pro- 
pulsion, is of exceptional interest. It.is the subject of a distinctive 
branch of mechanics that may be called the mechanics of variable 
mass. This phenomenon does not only occur in jet planes. Indeed, 
we can point to a number of commonplace occurrences involving 
such motion. As examples, it is sufficient to mention the case of 
an uncoiling roll of paper or the fall of droplets continuously con- 
densing in the atmosphere (see the example at the end of this sec- 
tion). The fundamentals of the mechanics of variable mass were 
developed at the end of the nineteenth century by Prof. I. V. Mesh- 
chersky. Since we cannot describe his work here, we shall restrict 
ourselves to the consideration of a single problem in this field—a 
problem related to the possible velocity of motion of a rocket. 

A rocket moves with a velocity v and at some instant ejects a 
certain amount of combustible gas having mass dM. The mass of 
the rocket, naturally, decreases by this amount. If the velocity of 
the ejected gas is designated by u (this velocity is not given with 
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respect to the rocket, but with respect to the inertial coordinate 
system in which the velocity of the rocket motion is described), 
the momentum of the matter escaping from the rocket will be equal 
to udM. The rocket decreases its mass and increases its velocity by 
the amount dv. The momentum of the rocket after ejecting the fuel 
is equal to (M—dM)(v-+dv). In accordance with the law of con- 
servation of momentum, we can equate the momentum Mov of the 
rocket before discarding a portion of the fuel and the momentum 
of the system after that quantity of gas has been ejected. The lat- 
fer is equal to the difference between the momentum of the rocket 
and the momentum of the portion of fuel. Thus, 


Mu = (M—dM) (uv + dv)—udM. 


Whence, excluding second-order infinitesimals, 


dv=(u+o)%, 


But u-+v is the relative velocity of the outflowing combustible 
gas (with respect to the rocket). Designating this velocity by c, we 
arrive at the following equation for the increment in the rocket’s 


velocity: du=—cSt The minus sign is used to show that the 


velocity increases when the mass decreases. It can be seen that the 
“increase in velocity is equal to the fraction of the lost mass mul- 
tiplied by the relative velocity of the ejected fuel. 

Taking the velocity of the outflowing gas with respect to the 
rocket to be a constant value, the above equation can be easily in- 
tegrated. If the mass of the rocket was M, when the velocity of 
the rocket was v,, and became equal to M when the velocity of 
the rocket changed to v, integration yields 


M. 
v—v,=cln a. 


The latter formula was initially obtained by K. E. Tsiolkovsky, 
the first to design a rocket and do research in the theory of inter- 
planetary travel. 

Going over tocommon logarithms and introducing ‘the designa- 
tion m= _M,—M for the dillerence in the mass of the rocket, i.e., 
for the mass of the ejected fuel, we obtain Tsiolkovsky’s formula 
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in the form 


m 
v=CX2.3 x log (1 +i) 
(the initial velocity v, is assumed to be equal to zero). 
In modern rockets, the velocity of gas outflow is probably not 
less than 2,000 m/sec. Using this velocity in the formula, the follow- 
ing table of values is obtained: 


i 0.25 #410 40 # 10.0 32.3 54 999 
v(mjsec) 446 1,386 3,218 4,817 7,013 8,000 13,815 


As can be seen from this table, the rocket velocity increases 
much slower with respect to the amount of ejected fuel than one 
would like. To give the rocket a large velocity, a tremendous 
amount of fuel, relative to the initial mass of the rocket, must be 


ejected. Thus, if a velocity of 7 km/sec is imparted, less than 


a of the initial rocket mass will remain. 


A velocity of about Il km/sec must be imparted to a rocket if 
it is to escape from the Earth’s gravitational pull. This figure is 
obtained in the following simple manner. To escape from the 
Earth, a rocket must possess sufficient kinetic energy to perform 
the work of moving a body from the Earth’s surface to infinity. 
But this work against the force of gravity is equal to the differ- 
ence between the rocket’s potential energy at the Earth’s surface and’ 
at infinity. Since at infinity the potential energy is equal to zero, 
ES condition for escape from the Earth has the following simple 
orm: 

me? mM 

“2 SR? 
where M and R are the Earth’s mass and radius, respectively. 
Multiplying the numerator and the denominator of the right-hand 


member of the equation by R, then substituting vB by g, the 


acceleration of the gravitational force at the Earth’s surface, and 
cancelling the rocket’s mass, we obtain the condition for escape 
from the Earth: v= V2gR, which yields a figure of about Il km/sec. 

If we assume that the velocity of the gas outflow is 2,000m/sec, 


the ratio ai can be obtained from Tsiolkovsky’s formula. It is 
equal to 244. For the rocket to escape from the Earth, its design 
must be such that only oa of the rocket’s mass before take-off 


will remain in its interplanetary flight. The problem is an excep- 
tionally difficult one. Basic advances in the field of astronautics 
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will be achieved, therefore, when methods of producing rocket 
engines with considerably larger velocities of fuel ejection are 
found. If we were to succeed in increasing this velocity by a 
factor of three, i.e., increasing this velocity to 6 km/sec, the ratio 


m 
aM would fall to 5.3. 


It is easier to put an Earth satellite into orbit because a small- 
er initial velocity is required. If we assume that the acceleration 
of the gravitational force at the heights at which we desire the 
satellite to orbit is approximately the same as at the Earth’s 
surface, then the law of mechanics for artificial planets will have 
the form mg = ma, and since the satellite moves in a circle, the cen- 


tripetal force is a=%. Thus, the velocity of the rotating satel- 


lite is v=VgR, i.e, 8 km/sec. When such a velocity is impart- 
ed to a rocket, it -is transformed into an Earth satellite.- From 


the above table, we see that the value of 5 required for imparting 
a velocity of 8 km/sec to a rocket is 54. 


Example of motion of a body with variable mass. Consider a water droplet 
falling in an atmosphere saturated with water vapour. At the instant of time ¢, 
the droplet has a mass m and a radius r. During the time dt, the volume of 
the droplet, and hence the mass (for a density equal to 1), increases by 4ar?dr. 


Thus, the rate of increase in mass is 4a S. At the same time, it is clear 


dm the rate of condensation of the water 


at’ 
vapour, must be proportional to 4mr?, the condensation surface. Hence 


and r=&t, where & is some constant of proportionality. 

Let us derive the equation of motion of this droplet in the Earth’s gravita- 
tional field. We are interested in the change of momentum d (mv), which accord- 
ing to the fundamental law of mechanics is equal to Fdt, where F=mg. Thus: 


from physical considerations that 


ers nst 
> GET OO 


dad dv dm 
F= = (mo), i.e., MSM E+ Fe 


Substituting the expressions for m and r, we obtain 
dv 30 


a8 T° 
By integrating this equation, we arrive at the following result: v= £4, i.e, the 


droplet falls with the constant acceleration +g = 245 cm/sec, The resistagce 
of the air was not taken into consideration. ; 


CHAPTER IV 


ROTATION OF A RIGID BODY 


18. Kinetic Energy of Rotation 


In this chapter, we shall be concerned with “perfectly rigid” 
bodies. This means that we may neglect any deformation occur- 
ring during the motion of such a body, and assume that’ the dis- 
tances between the particles of the body remain unchanged. 

Let us consider a rigid body rotating about a fixed axis passing 
through it (Fig. 25). We can conceive of the body as consisting 
of small volumes of masses Am,, Am,, ... at distances r,, r,,... 
from the axis of rotation. Corresponding to 
the various values of the distances are the 
various velocities of motion v,, v,, .... We 
are interested in the kinetic energy of rota- 
tion of the entire rigid body, which is com- 
posed of the kinetic energies of the individual 
particles Am,, Am,, ..., i-€., 

Amv? — Amv; Am,v§ 


gg Pp ag he 


The velocity of angular motion of any 
point of the body can be easily expressed in 
terms of w, the angular velocity of the rotat- 
Fig. 25 ing body. If the body turns through an angle 
dp in the time dt, the derivative ge 


‘dt 
called the angular velocity: 











is 
— 4 
=a: 


For the case of uniform motion, the above formula is transformed 
into a relation already known to the reader, namely, o=e, The 
quantity @ is usually measured in radians per second. If the body 
performs | revolution per second, its angular velocity is equal to 
2n rads/sec. 

Different points of a rotating rigid body have different veloci- 
ties v (called linear velocities), but the angular velocity @ is the 
same for each point. In turning through an angle dg, a point de- 
scribes an arc ds=rdqg. Dividing both members of this equation 
by the time of motion dt, we obtain a relationship between 
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linear and angular velocity: 
v= OF. 


Thus, the formula already known for uniform motion is valid in 
the general case. 

Using this relation, the expression for K,,, may be written in 
the following form: 


K rot = s (r:Am, + r2Am, +...). 


The quantity in brackets does not depend on the velocity of 
motion, but is a measure of the inertial properties of a body exe- 
cuting rotational motion. The greater the value of the expression 
in brackets, the greater the energy that must be expended to 
achieve a given.velocity. Therefore, the quantity 


T=rjAm,+rAm,+... 


is called the moment of inertia of the body, and the term r?Am is 
the moment of inertia of a particle. The quantity / may be ex- 
pressed more briefly as follows: 


I=\rdm, 


where the integration (summation) encompasses all the particles of 
the body. : 

The formula for a body’s kinetic energy of rotation acquires the 
form 


Lo? 
K rot ae Wee 


This formula is valid for a body rotating about a fixed axis. 
For a rolling body (a ball, wheel, etc.), the energy of motion will 
consist of the energy of rotational and translational motion. Thus, 
if a rolling body has a mass M, moment of inertia /, translational 
velocity v and rotational velocity w, the kinetic energy is 


Mv? , Iw? 
Krou=—y tz: 


Moreover, it turns out that this formula is valid for any arbitrary 
motion of a rigid body. In theoretical mechanics, it is shown that: 
any: arbitrary motion can always be resolved into translational and 
rotational: motion. The rotation, in this case, must be considered 
with respect to an axis passing through the centre of mass. 
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19. Moment of Inertia 


If we carefully examine the formula for moment of inertia, we 
see that the value of / depends on the nature of the distribution 
of the mass with respect to the axis of ro- 
tation. The particles that are far from the 
axis of rotation contribute considerably 
more to the total value than those that are 

close to it. 

Let us calculate the moment of inertia 
of a flat disk, of radius 7, relative to the 
axis perpendicular to the plane of the disk 
and passing through its centre (Fig. 26). 
The mass of an annular element of radius 
x is dm=oxX2nxdx, where @ is the den- 

Fig. 26 sity of the disk’s material. This ring has 
a moment of inertia d/,-=dmx’, and the 
moment of inertia of the entire disk is: 


r 
mr 


r 
1,=) dl, = | ox2axxtde=2n9 P=". 
0 


0 


It is evident that with respect to the same axis the moment of 
inertia of a ring is [,=mr’, i.e., /,=2/,, when the entire mass 
is concentrated at the outer circumference. 

The moment of inertia of a body will vary in accordance with 
the location of the axis of rotation. If a thin rod rotates about 
its long axis, the moment of inertia will be very small, for all the 
particles lie very close to the axis of rotation and, therefore, all 
the quantities rj, r:, ... entering in the formula for / will have 
very small values. The moment of inertia will be much larger if 
the rod is rotated about a line perpendicular 
to its axis. 

The moment of inertia depends on the orien- 
tation of the axis and the location of the point 
through which it passes. If no specific stipula- 
tion to the contrary is made, it is assumed that 
the axis of rotation passes through the body’s 
centre of mass. 

If the axis of rotation is displaced relative 
to the centre of mass by the amount a (Fig. 
27), I, the new moment of inertia, will differ 
from/,, the moment of inertia with respect to the 
parallel axis passing through the centre of mass. 
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In view of what was stated at the end of the previous article, 
we can express the kinetic energy of a body rotating about the 
displaced axis as the sum 


K=Me 418, 


where v is the velocity of motion of the centre of mass and is 
equal to aw. Thus, 
R= 


Me wo? 


+4 =F (1, + Ma). 





Hence, the moment of inertia / with respect to a parallel axis, 
displaced by a distance a from the centre of mass, can be ex- 
pressed as follows: 

I=1,+Ma’. 


It follows that the moment of inertia with respect to an axis 
passing through the centre of mass is always the smallest possible 
for a given orientation. Depending on its symmetry, a body will 
have one, two or three moments of inertia with respect to the 
main axes passing through the centre of mass. 

Thus, a disk is characterised by two axes passing through its 
centre—one lying in the plane of the disk and the other os 
dicular to the disk. The moments of inertia are then and 
respectively (it is assumed, naturally, that the distribution of Pa 
throughout the disk is elon For a ring, the moment of iner- 


tia about similar axes is > and mr*, respectively. 


For all solids of revolution, it is sufficient to know the moments 
of inertia with respect to two axes. In the case of a body of ar- 
bitrary form, to completely describe the inertial properties of the 
body during rotation, it suffices to know three moments of inertia 
with respect to axes passing through the centre of mass, namely, 
Tnax—the largest moment of inertia, J/,;,--the smallest, and 

mean —the moment of inertia with respect to an axis perpendicu- 
lar to the first two. 

The only body for which the moment of inertia about all the 


: : 2 
axes is the same is a sphere. For a sphere, l= =m’, 


The above formulas for moment of inertia are calculated from 
the relation: 


[= y rdm. 
To use this formula, it is generally necessary to be able to oper- 


ate with multiple integrals. Examples of such calculations are 
given in courses on theoretical mechanics. 
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As we shall see below, physicists are sometimes interested in 
the values of moments of inertia for molecules. Since the mass of 
atoms is concentrated in nuclei whose dimensions are very small, 
the calculation of the moments of inertia can be accomplished 
without difficulty, for the atoms may be considered as_ point 
masses. 

For a diatomic molecule, the moment of inertia with respect to 
the axis passing through the atoms is equal to zero. For the axis 
_perpendicular to the line joining the atoms, we obtain 


f= mgr, + mar, 


where r, andr, are the distances of atoms A and B of a dia- 
tomic molecule to the centre of mass. If 7 is the distance between 


r m 
atoms, r,-+rg=/ and-1—=#. Therefore, 
Tra Mma 


l= MaMR 72 
=—__ + /?, 
mat+mp 


The moments of inertia of more complex molecules may also be 
calculated as the sum of the moments of inertia of the atoms con- 
sidered as point masses. 

Examples. 1. The flywheel of a ship’s engine has a mass of about | ton, 


a diameter of 2 metres and, therefore, a moment of inertia /~1,000 kgm. 
Making 300 rpm, the flywheel possesses a kinetic energy of rotation 


K =" = 500,000 joules = 50,000 kg-m. 


2. The moment of inertia of the Earth is about 10% gm cm?= 105? kgm? 
The kinetic energy of rotation of the Earth about its axis is 2.5107" joules. 
3. In a molecule of hydrogen H,, the distance 1=0.753x107* cm, the mass 
of the hydrogen atom my=1.6598x107~** gm and, therefore, the moment of 
inertia of the molecule with respect to the axis perpendicular to / is 
mH py? 


[=—y—= 0.46 x 107% gm cm’. 


20. Rotational Work and the Fundamental Equation 
of Rotation 


If a body fixed on a shaft is made to rotate by a force F or, 
on the other hand, if a rotating body is braked by the force F, 
the kinetic energy of rotation increases or decreases by the mag- 
nitude of the expended work. Just as in the case of translational! 
motion, this work depends on the effective forces and the displace- 
ment produced thereby. However, the displacement is now angu- 
lar, and the expression that we know for the displacement of a 
particle by a certain distance is not applicable here. 
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To find the formula that we are interested in, let us refer to 
Fig. 28. The force F is applied at a point located at a distancer 
from the axis of rotation. The angle between the direction of the 
force and the radius vector is 
designated by §. Since the body 
is perfectly rigid, the work of 
this force {even though applied 
at one point) is equal to the 
work expended in rotating the 
entire body. In rotating the body 
through an angle dg, the point 
of application traverses the path 
r dp and the work dA, equal to 
the product of the projected 
force along the direction of displacement and the magnitude of the 
displacement, is then 





dA= Fr sin6 dg. 


Frsin§ is known as the moment of force or torque: M= Frsin0. 
From the diagram, it is seen that rsinO=d, where d is the short- 
est distance between the line of action of the force and the axis. 
of rotation. Hence, 

M = Fad, 


i.e., the torque is equal to the product of the force and the lev- 
er arm. 
The formula for work that we have sought is 


dA = Mag. 


The work of rotating a body is equal to the product of the effec- 
tive torque and the angle of rotation. 

Strictly speaking, the formula is only valid for an infinitely small 
angle dp. However, we may use it in any case if we understand 
M to mean the average value of the torque for the time of rota- 
tion. Then, 

AA=M,,Agq. 


The work of rotation goes to increase the kinetic energy of rota- 
tion. Hence, the following equation must hold: ¢ 


Mdp—d (>). 


If the moment of inertia is constant for the time of motion, then 
Mdp = Iwdw 
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; dy 
or, since O=7, 


dw 


M=I!I-. 


This is the fundamental equation of motion for a rotating body. 


The torque acting on a body is equal to the product of the mo- 
ment of inertia and the angular acceleration . 


Examples. 1. The torque on a wheel of a locomotive developing a traction 
of about 10,000 kg is about 300 kg-m= 3,000 newtonx metres. 

- A man riding a bicycle produces a torque of 
about 100 newtonxmetres on the pedals. 

2. By means of an example, we shall show the 
connection between the expression for the kinetic 
energy of a moving rigid body (see p. 179) and the 
fundamental law of mechanics. 

Let us consider a spool of mass m and radius 
r, possessing a moment of inertia J with respect 
to its axis and wound with weightless thread (Fig. 
29). The free end of the thread is fastened at a 
certain height above the Earth’s surface. The spool 
is allowed to fall under the action of its own 
weight mg. Hence, the equations of motion for 
the spool are: 





do 
and 
dy 
Trl ’ 


where T is the tension of the thread and is the 
angular velocity of rotation of the spool. Eliminat- 





mg | ing T, we obtain for the acceleration: 
29 a= =—* 
Fig. ; at pee 
g 44, 


If time is counted from the moment the spool begins to fall, then in ¢ seconds 
the spool will fall a distance h=e. It is evident that the total kinetic 


energy of the spool at that instant is equal to the change in the potential 
energy of the spool: 


K=mgh=mg =. 


Substituting the expression for a, we obtain 


mo? , ly? 
Reg. Te 
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21. Angular Momentum 


The similarity between the formulas of motion of a particle and 
the derived laws for the rotation of a rigid body is immediately 
evident. Thus, compare the following formulas: 


Particle Rotating body 
F=m% Male 
K= “ Kea ~ 


Clearly, the physical eects are also an While in the 
mechanics of particles the acceleration is determined by the force, 
in rotational motion the angular acceleration is determined by the 
moment of force, i.e, the torque. The role of mass is played by 
the moment of inertia, which in rotation is the measure of a 
body’s inertia (the mass alone is insufficient here for this purpose). 
This similarity encourages us to go a step further and assume that 
analogous physical quantities are related by analogous relations. 

In the previous chapter, it was established that the momentum 
p=mv is a physical quantity satisfying the law of conservation in 
a closed system. The quantity analogous to p is the moment of 
momentum (angular momentum): 


N=Io. 


It can be rigorously proved that angular momentum satisfies the 
law of conservation, i.e., in a closed system, the total angular 
momentum of the bodies belonging to this system does not change. 
An increase in the angular momentum of one of the bodies is com- 
pensated for by an equivalent decrease in the others. 

The relation 


o,+/,0,+/,@, -+...= const 


has many interesting applications that are in many ways analogous 
to the problems studied in the previous chapter. 

The law of conservation of momentum when applied to a single 
body has the form mv-==const and is, therefore, identical with the 
law of inertia. Even in this simple case, the law of conservation 
of angular momentum leads to an interesting result. A single 
body, in the absence of interaction with its medium, Push: satisly 
the condition 

Jo = const. 


However, the moment of inertia of a body may change during 
motion. It is, therefore, evident that an increase in J must be 
accompanied by a decrease in w, and vice versa. 
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One can cite numerous examples, and this phenomenon can be 
strikingly demonstrated by means of a swivel stool. Holding a pair 
of dumb-bells in your hands, be seated on such a stool (Fig. 30). 
Place your arms outstretched in a horizontal position and have 
someone give you asmall rotatory push. Motion takes place forthe par- 
ticular moment of inertia / at an angular velocity w. Now, fold your 
arms on your chest. As a result the moment of inertia drops 
sharply to /’. Since the product /w remains unchanged, /w=/'w’. 





Thus, changing the position of one’s arms leads to a considerable 
increase in the velocity of rotation. The process may be repeat- 
ed—stretching one’s arms out leads to retarded motion and fold- 
ing them produces accelerated motion. : . 

Decreasing the moment of inertia as a method of increasing the 
velocity of rotation is quite familiar to gymnasts and dancers. It 
is used in all kinds of jumps, tumbles and spins. Thus, a ballet 
dancer in a position of large moment of inertia will impart veloc- 
ity by changing her posture to a position of small moment of 
inertia (Fig. 31). 

Rotational recoil is usually demonstrated by means of the afore- 
mentioned swivel stool and a wheel fixed on a long axle (Fig. 32). 
While standing on the stool and holding the wheel above one’s 
head, the wheel is twirled by means of a sudden movement. As a 
result, the stool rotates in the direction opposite to that of the 
wheel. This is precisely what is meant by recoil. /,o,, the angu- 
lar momentum of the wheel, is balanced by /,@,, the angular mo- 
mentum of the stool with the person standing on it (the angular 
momenta have opposite signs). This is due to the fact that in the 
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Fig. 31 


81 


initial state both the stool and the wheel did not rotate, and the 


total angular momentum was equal to zero. 


An inelastic impact was defined above as an encounter between 
two bodies as a result of which the bodies move together. Some- 
thing analogous may be demonstrated in the case of rotation using 


the equipment just described. The wheel is 
made to rotate and is then transferred to a 
person standing on the stool. Thus, the ini- 
tial state is the following: the stool and the 
person standing on it are at rest, while the 
bicycle wheel is rotating with a momentum 
I,o,. Now, the person on the stool takes hold 
of the wheel. The angular momentum /,o, 
cannot disappear, but it now belongs to the 
entire system. Naturally, the person on the 
stool and the wheel rotate together in the 
same direction as the wheel was rotating. 
Clearly, J,o,=(/,+/,). If before “unifica- 
tion” the person rotated with a velocity o,, 
the angular momentum to be conserved is 
1,0,-+/,o,. Therefore, i 
oO, +/,0 
1,0, +1,0,=([,+1,)0 or o= 2352", 

This is very similar in form and in content 
to the expression for inelastic impact. 
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Examples. 1, The flywheel of a ship’s engine has a moment of inertia of 
1,000 kg m? and at 300 rpm its angular momentum is ~ 30,000 sme : 

2. A billiard ball whose radius is 2.5 cm has a moment of inertia ] = 250 gm cm? 
and moves with a velocity of 5 m/sec without skimming the table. Its angular 
momentum is then ~ 50,000 gmcm?/sec=5x 1075 kg m?/sec. 

The angular momentum of the Earth in rotating about its axis 
is~10*4 kg m?/sec, 


22. Free Axes of Rotation 


Let us assume that a body has received angular momentum 
about some axis to which the body is fastened. Further, let us 
assume that the fastening is then removed. While the angular mo- 
mentum must be conserved (naturally, neglecting friction), the ori- 
entation of the body in space may change. If this occurs, and as 
a result there is a change in the moment of inertia, it will be 
compensated for by a corresponding change in the angular velocity. 

However, in a number of cases the nature of the rotation does 
not change. Stable rotation takes place about the original axis, 
just as if the axis of rotation were fixed as before. Theory and experi- 
ments show that there are two axes passing through the centre of mass 
that may be permanent, free axes of rotation, namely, the axis of 
aasiouen moment of inertia and the axis of minimum moment of 
inertia. 

If the fixed axis of rotation passes through the centre of mass 
(Fig. 33), but is inclined to the axes of symmetry and, therefore, 
to the afore-mentioned 
orientations, then after 
the fastening is removed, 
the body begins to 
, change its orientation 

with respect to the axis of 
rotation. It can be seen 
from the figure that the 
reason for the change of 
orientation is the fact 
that the centrifugal 

Fig. 33 forces form a couple of 

forces. The body will 

continue changing its orientation until the axis of rotation becomes 
a free axis. 

It can be shown in a number of ways that a freely rotating 
body will keep changing its axis of rotation until the rotation 
occurs about a free axis. Tying bodies of various shapes to one end of 
a string, and attaching the other end to the shaft of a rapidly 
rotating motor, we can transmit rotary motion to a body without 








Sec. 22] Free Axes of Rotation 83 





having a fixed axis of rotation. In Fig 34, the successive orien- 
tations of a rotating hoop, chain and match box are shown. The 
match box will begin rotating about its shortest or about its long- 
est edge. Theory shows that rotation about the axis having a 
mean moment of inertia will not be stable even if this axis is an 
axis of symmetry. 





Fig. 34 


In constructing one of the first turbines, attempts fo fix the position 
of the shaft with sufficient accuracy to eliminate the couple of centri- 
fugal forces acting on the bearings at a velocity of 30,000 rev/min were 
unsuccessful. At such high velocities, these forces are intolerably great. The 
problem was solved by using a flexible shaft for the turbine wheel. The rota- 
tion took place about the free axis and the flexible shaft adapted itself to this 
axis. 

Let us consider this phenomenon in somewhat more detail. We shall desig- 
nate the shift in the centre of gravity of the turbine wheel due to the wheel’s 
asymmetry by a and the amount by which the shaft sags under the action of 
the centrifugal force by A. The shaft sags in the direction of the asymmetry. 
Hence, the expression for the centrifugal force may be written in the form 
4n7n?M (a-+-A). This force is balanced by the elastic force RA, where & is the 
stiffness of the shaft. Thus, : 


1 
A=a 7 


tne 


The formula shows that when the number of revolutions per minute n is large 
the shaft’s sag A does not increase, but tends to become equal to minus %, the 
measure of the wheel’s asymmetry. This means that when the angular velocity 
of the turbine increases the total displacement of the wheel with the shaft from 
the axis of rotation tends to become equal to zero. Herein lies the adaptabil- 
ity of the flexible shaft: It can bend, without breaking, by the amount re- 
quired to eliminate the centrifugal force. 

From the above formula, it follows that the condition k/4%?n?M—1 is crit- 
ical, for the- relation shows that the shaft’s sag becomes infinitely large. This 


4* 
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is the instant of resonance which must be rapidly passed in running the turbine 


(the external frequency n=pV dt. i.e., n coincides with the natural fre- 


quency of a turbine wheel of mass M placed on a shaft of stiffness k; see 
Chapter V). 


23. The Gyroscope 


The term gyroscope usually denotes a device that can rotate about 
any orientation of its axis. If a gyroscope is rotated and then not 
interfered with, its axis of rotation will remain unchanged as long 
as no forces act on it (Jw, in this case, should not change). 

The action of a force on a gyroscope’s axis of rotation manifests 
itself in a somewhat surprising manner. This may be demonstrated 
using a gyroscope that is balanced by a load in such a manner 
that the axis of the device is horizontal (Fig. 35). The gyroscope 
is rotated in the vertical plane and a load G is placed on the 


ons, nN ees 
orn v Mee ate 


Fig. 36 





horizontal bar. It would seem that the entire right-hand portion, 
i.e., the gyroscope, should move upwards. Indeed, this would be 
the case if the gyroscope were not rotating. Actually, the rotating 
gyroscope begins to move with constant velocity about the vertical 
axis as shown by the dotted line and the arrow. This motion is 
at a right angle to the direction of the applied force. 

This phenomenon, revolving of an axis of rotation about the 
direction of an applied force, is called precession. Everyone is 
familiar with the precessional motion of a top. As soon as the axis 
of the top begins to deviate in the least from the vertical, a grav- 
itational torque begins acting on the top, tending to topple it. 
A stationary top would fall, but a rotating top begins to precess 
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about the vertical. The axis of the top will then describe a cone 
whose vertex is at the point of support of the top. 

In general, the rotation of a top is even more complex, for nu- 
tations are superimposed on the precessional motion. These nutations 
are due to small jolts (which are always present) that make the 
top shake (Fig. 36). As a result of the nutation effect, the axis 
describes a cycloidal curve, as shown in the figure, instead of a 
circle. It should be noted, however, that nutational effects are usu- 
ally very weak. 


CHAPTER V 


VIBRATIONS 
24. Small Deviations from Equilibrium 


The motion of a body or particle about an equilibrium position 
is often encountered in nature. Thus, a small load on a string 
oscillates back and forth, a spring quivers, and an atom in a crys- 
tal lattice vibrates. 

If the body or particle on which forces are acting is in a position 
of equilibrium, its potential energy is a minimum and the system is in 

. a potential well (Fig. 37). When the 
deviation from the equilibrium posi- 
tion is not large, we are concerned 
with only a small portion of the po- 
tential well. A potential curve near 
the equilibrium position can always 
be approximated by a parabola, 
i.e., it can be written in the form 


=> kx*. Here, + k is the con- 


3 stant of proportionality. The factor 
5 has been introduced for conve- 
Fig. 37 nience and its purpose will present- 

ly become clear. 

The reasoning used in arriving at the above relation is the follow- 
ing: Potential energy is a function of the displacement from the 
equilibrium position. As is well known, making the ‘proper assump- 
tions, any function may be expanded in a Taylor series for small 
values of x. The exponent of x increases consecutively from term 
to term: 





U=ax+4 kx" + bx* +ex*+... 


However, for small x, the terms of higher power may be neglect- 
ed and, if the potential well is symmetrical, the first term van- 
ishes, for the potential energies at equal distances to the left and 
right of equilibrium are equal. 

The force acting at a point deviating from the equilibrium posi- 
tion is equal to minus the derivative of the potential energy. Thus, 


if the energy is expressed by the formula U =F hx’, then F=— kx. 


Sec. 24] Small Deviations from Equilibrium 87 


The meaning of the negative sign is clear, namely, the force in 
question always restores the body to the equilibrium position and 
is always directed oppositely to the displacement. Consequently, 
the force F = — kx is called the restoring force and the coefficient k 
is sometimes called the restoring force constant. 

What is the nature of the motion under the action of the restor- 
ing force? Newton’s law, which is written in the form ma=— kx 
for motion near equilibrium, should give us the answer to this 
question. 

This equation is satisfied if the point undergoes harmonic vibra- 
tion about the equilibrium position, i.e., vibration in accordance 
with the relation 


2 
x= Acos> t, 


where T is the period of vibration. 
Let us verify this statement. The velocity of motion of the point 
for the indicated dependence between displacement and time is 


It should be noted that the maximum value for the velocity of 
the vibrational motion, i.e, the amplitude of the velocity, is 


Unaz= oe. Let us now determine the acceleration by taking the 


derivative of the velocity. We obtain 


4n? Qn 
a=— ar A cos Fr t. 


Substituting the expressions for acceleration and displacement in 
Newton’s law, ma=—kx, we obtain 


4x? Qn 2x 
—mazA cos =~ t = — kA cos t. 


We see that the factors depending on time cancel out. Hence, the 
equation for harmonic vibrations satisfies Newton’s law for small 
deviations from equilibrium. 

It is noteworthy that Newton’s law places a constraint on the period 
of the vibrations. As can be seen from the last formula, the period of the 


‘free vibrations about the equilibrium position is T=2z +. 


This period is determined by the vibrating system—the restoring 
force constant k and the mass of the particle. It is, therefore, 
understandable that this period is called the natural or character- 
istic period of the vibrating system. 
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No restrictions are placed on the amplitude A of the vibrations, 
with the exception, of course, that the deviations from the equi- 
librium position must be smadl. 


25. Particular Cases of Vibrations 


In view of the fact that we deal with two types of potential 
energy in mechanics, namely, elastic and gravitational, it also be- 
comes possible to divide mechanical vibrations into these two cases. 

Bodies vibrating under the action of an elastic force usually 
perform linear vibrations of compression and extension. However, 
torsional vibrations are also encountered. 

If a body suspended from an elastic band, spring or wire is 
displaced from the equilibrium position along the band, spring or 
wire axis, linear vibrations arise under the action of the elastic 
restoring force. The coefficient & is, in this case, the stiffness of 
the vibrating body. 

To what extent this coefficient determines the resulting period and frequency 
of vibration is seen from the following example. Identical loads, whose masses 
are equal to | kg, are suspended from three springs having different stiffnesses. 
Under the action of these loads, the springs are elongated by 1 mm, 1 cm and 
1 metre, respectively. The coefficients of stiffness will then have the following 
values: 


fe, = S81 10" 0.981 x10" BBS; k= 0,981 x 108 YES ; 
k= 0.981 108 S225 


. 


The periods and frequencies of the vibrations are: 


T= 20 yf Bom cage = 684x107 sec, v,= 15.8 eps; 


=0.2 sec, ar CPs; 
ees v,=0.5 cps. 


For torsional vibrations, the restoration to equilibrium takes 
place under the action of a torsional moment that is directly pro- 
portional to the angular displacement for small deviations from 
equilibrium. If, for example, a massive disk having a moment of 
inertia / is suspended from a wire, and the wire is twisted by 
some angle or other, the equation for the torsional vibrations of 
the disk will be [2=—Dy. The torque D, relative to unit angu- 
lar displacement, corresponds to the restoring force constant, and 
the moment of inertia corresponds to the mass. Thus, the period 
of free torsional vibrations is represented by the formula 


Tam Vt. 
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The greater the moment of inertia, the lower the frequency of the 
vibrations. 


Example. Assume that a disk having a mass of 100 gm and a radius of 5 cm 
is suspended from a steel wire and that the period of the torsional vibrations 


is | second. The moment of inertia of the disk is 1, = 72 = 1,250 gm-cm?, 


Thus, the restoring force constant D=454 — 49,400 dune ce If a disk of the 
same mass but of 1 cm radius is suspended from the same wire, the period of 


the torsional vibrations will no longer be 1 sec, for 7,=2x 0.2 sec. 


A body oscillating under the action of gravitational force con- 
stitutes a pendulum. If the pendulum may be approximately repre- 
sented as a point mass suspended from a weightless wire, we call 
it a mathematical pendulum (Fig. 38). 

From the figure, it is easily seen that the expression for the 
restoring force is mgsina, i.e., the component of the weight 
along the tangent to the path. If the devia- 
tion from equilibrium is small, the sine of - 
the angle may be replaced by the value of 
the angle a or by the quotient obtained when 
the displacement x is divided by the wire 
length /. In this approximation, displacement 
along the chord is assumed to coincide with 
displacement along the arc. Thus, the restor- 


ing force is equal to mg + and the restoring 


force constant is equal to as. In the expres- 
sion for the period, the mass of the bob can- 
cels out and T=2n V/ +. 

The fact that the period of a pendulum does 
not depend on the mass is an example of a 
common feature of particle motion in a grav- 
itational field. Since according to the law 
of gravitation the force acting on such a 
particle is proportional to the mass, the mass m 
cancels out in the equation of motion. Thus, we 9 
have arrived at the well-known result that, Fig. 38 
for a given location in a gravitational field, 
the period of a mathematical pendulum depends only on its length. 

The measurement of the period of a pendulum may be used to 
determine g. The value of this measurement may be determined 
extremely accurately so that very minute variations in the value 
of g may be ascertained. Various methods of determining the 
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Earth’s shape and various gravimetric investigations are based on 
this measurement. (Small changes in the value of g, which, how- 
ever, greatly exceed the limits of experimental error, may occur 
due to seams of various density below the Earth’s surface.) 

When the small oscillations of a physical body cannot be approx- 
imated by a point mass, the pendulum is called a physical pen- 
dulum, Fig. 39 shows a rigid body whose axis 
of rotation (oscillation) passes through it. The 
period of the physical pendulum is calculated 
by the same formula as for torsional vibrations: 


T =2n 4, 


since the equation 


do 


is valid for the motion of any body rotating 
about an axis. However, in the case of the grav- 
as itational field; we can easily express the torque 
Fig. 89 relative to unit angular displacement by a 
more direct pendulum characteristic. From the 
same figure, it can be seen that the torque is equal to mgrsina, 
i.e., the product of the weight of the body, the distance r from 
the centre of gravity to the point of suspension, and the sine of the 
angle of deviation from the equilibrium position. Since the deviation 
from the equilibrium position is assumed to be small—as always 
in this section—we obtain the expression mgra for the torque; 


whence, D= "828 = mgr. Thus, the period of a physical pendulum is 


given by 
V 1 ry 


The quantity vit is called the equivalent length of the phys- 


ical pendulum. This is the length that a mathematical pendulum 
would have for such a period. 





26. Transformation of Energy. Damped Vibrations 


If there is no friction, the total energy @ of a body naturally 
remains unchanged for vibrations about its equilibrium position. 
Since potential energy is usually expressed relative to an arbitrary 
level, we shall assume that the potential energy in the equilibrium 
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position (displacement x=0) is equal to zero. At any instant of 
motion, 


mou? , kx? 
ae a 


In the equilibrium position, the kinetic energy is a maximum. 
In the end positions, the body comes to a standstill (v =O and 
x= A) and the potential energy is a maximum. It is evident from 
this, incidentally, that 


G=3> 


i.e., the vibrational energy is proportional to the square of the 
amplitude. 
For the three springs considered in the example on p. 88, assuming the 


amplitudes of the oscillations are the same, i.e, A=O.1 cm, the total vibra- 
tional energy will have, respectively, the following values: 


&:=0.49X 10° ergs; g¢,=0.49X10* ergs; g;=49 ergs. 


This discussion has not taken into account the frictional force, 
which, as a rule, is experienced by all vibrating bodies. Such ideal 
vibrations will continue for ever without 
change in amplitude. Friction, however, 
produces damped vibrations. Formally, 
in this case too, it is possible to write 
the displacement equation in the form 


x=Acos aot, 


but A is understood to decrease with 
time (Fig. 40). To determine how A de- 
pends on time, the frictional force must 
be known, i.e., f,, must be known for 
every instant of time during which vi- Fig. 40 
brations occur. A simplifying assumption, 

more or less satisfied in practice, is that the frictional force is 
proportional to the velocity of motion: 


Fie oa Qu, 





where the coefficient a is known as the resistance constant. 


For a ball having a radius of 0.53 mm, the resistance constant a at about 
"15°C is 13.93 gm/sec in glycerine, 0.35 gm/sec in sulphuric acid and 0.01 gm/sec 
in water. 


The energy equation can now be written in the form 
dG =— avdx 
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and the vibrating particle continuously loses an amount of energy 
equal to the work of the resisting force. Hence, the equation of 
motion is written as follows: 


ma=— kx—av, 


By substitution, it is not difficult to show that this equation is 
satisfied by the equation x=Acoswt when the amplitude A de- 
creases exponentially with time: 


eabeay 
= 2 
A=Ae 2 , 


Here, A, is the amplitude at the instant of time ¢=0. 

It should be noted that the ratio of two successive amplitudes 
is a constant. Thus, the expressions for the amplitude after n—1 
and n periods, respectively, are 


(n nT 


—— (n—1)T os 
A,-.=A,e 2” and A,=A,e 2” 


Let us divide the former relation by the latter. The ratio 


Ans . 


a 
=e 2m 
n 


does not, in fact, depend on n. The rate of damping is sometimes 
expressed by the logarithmic decrement 6: 


A a 
fans 27; 
An 2m 


Thus, the damping is greater, the greater the resistance constant, 
the smaller the mass, and the greater the period. 

It should be noted that the period of damped vibrations differs 
from the ‘period of free vibrations. The same calculation that 
leads to the formula for the time dependence of the amplitude 
also yields the following relation for the period: 


en ee 
=. 

V) paige Las 

4mk 


This means that, for small resistance, T differs little from T, = 





T=T, 


==2n - When the resistance increases the period increases 
and, finally, for 


a? 
4mk / 
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vibrations cease. We say,-in this case, that the body displaced 
from the equilibrium position returns aperiodically to this position. 

Here are some approximate values for the logarithmic damping 
decrement of certain vibrating systems: 


Acoustical vibrating systems ..... 0.1 
Electrical oscillation circuits .... 0.02-0.05 
Tuning fork ae 10-5 
Quariz crystal sees 1074-1075 


Let us consider several examples of damped vibrations. 
a) Vibrating tuning fork, Logarithmic decrement é=— T=107* Assume that 


the period of the vibrating tuning fork is T=0.01 sec. Then, —=0.1 sec™!: 


This means that during the time 7 = 10 sec the amplitude of the vibrations 


decreases by the factor e: 
a 


nd 
A;=Aje *” ; Ape = Aye}. 


The quantity at is called the time constant of the given vibrating system. 


b) In acoustical vibrating systems, as can be seen from the above table, 
the logarithmic damping decrement is large. This means that the vibrations are 


rapidly damped. If 6=— 7=0.1, the amplitude of the tenth vibration, Ajo, 
will already be less than the initial amplitude A) by the factor e. Thus, 
Ay Ay A ee a 


ee EA Se , 1.0., ~~ =e. 
A, A, A, Arg At 


c) The change in the period of damped vibrations may be conveniently 
illustrated by means of a spring. Let a load having a mass m=50 gm be 
suspended from a steel spring, which is thereby elongated by 2cm. Thus, the 
stiffness of the spring is k=24,500 dynes/cm. If there were no damping, 


Ty=20y/ 4 = 0.28 sec, 


Assume that the damping is such that the time constant t,= “7 =5sec, i.e., 
- the resistance constant a=20 gm/sec. The period of the vibrations then becomes 
FG n ony 
Vote 
Let us now immerse this pendulum in liquid. Assuming the time constant in 


this case to be t,==1 sec, the amplitude of the fourth vibration will already be 
fe of the initial amplitude, i.e., there is considerable damping: 


T,T, (1+102 X 1075) ~ 1.001 Ty. 


Thus, even in this case the period increases by only 0.1 per cent. 


T= = T, (1+4.08 107%). 
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27. Forced Vibrations 


If a body is displaced from its equilibrium position and then 
not interfered with, the vibrations occur at the natural frequency 
of the body, independent of the nature of the excitation, i.e., the 
vibrations are determined only by the properties of the system. 
The frequency of the vibrations of a string remain the same re- 
gardless whether the sound was made by the string being plucked 
or struck. 

At the same time, a number of means exist of “locking” the 
vibrations of a body to an external frequency. Such forced vibra- 
tions may take place if two bodies capable of vibrating are coupled. 
One of the bodies will force the other to vibrate. A motor that is improp- 
erly balanced will execute vibrations that are transmitted to the foun- 
dation, i.e., the foundation will execute forced vibrations. We can 
perform the following experiment: A pocket watch is placed in a small 
box and suspended by three strings. As a result, the box passes 
into a state of forced vibration. In Fig. 41, a device is shown in 
which a rotating eccentric makes a pendulum pass into a state of 
forced vibration. In all these cases, a periodic force varying with 
some frequency acts on a body. Such a force is aptly called an 
external force. 

Forced vibrations do not set in immediately. A certain amount 
of time must elapse before the body coupled to the vibrating 
system begins to vibrate. Eventually, a particular amplitude is 
reached and the frequency of the vibrations will be exactly equal to o. 

The fact that a body has a natural frequency of vibration , 
nevertheless affects the phenomenon of forced vibrations. To be 
more exact, as we shall directly see, the natural frequency and 
the external frequency differ significantly. Fig. 42 shows the 


— cae 
&lé 


Fig. 41 Fig. 42 
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dependence of the amplitude of forced vibrations on the ratio = 


o 
for three systems having different amounts of friction. When the 
external frequency and the natural frequency coincide, the 
amplitude of the forced vibrations is a maximum. This phenom- 
enon is widely known as resonance. 

The curves shown in Fig. 42 may be determined theoretically. 
The equation of motion of a body executing forced vibrations, 
under the action of a periodic external force F,coswt, has the 
form 

ma= —kx—av-+F, cos wf. 


By substitution, one can easily show that the displacement of a 
vibrating point will satisfy the equation 

x= Acos (wt +8), 
where the amplitude 


Ae eet 


V m? (@? — w?)?-+- 020? 
and the phase shift B satisfies the equation 


fae Be eee 
B m (@,—o?) 
Taking into account that a= ** and v= %, let us substitute these values 


at? at? 
in the equation of motion. After simple conversion and grouping terms 
containing coswf and sinwt, we obtain 
{(— mw?-+-2)A cos B—awA sinb— Fy] cos ot — 
— [(— mw?--k)A sin B--aw A cos B] sin of =0. 

Since the obtained equation must be valid for every instant of time, the 
coefficients of coswt and sinwt must be equal to zero. Thus, we obtain two 
equations for determining A and f: 

[(— mw?--&) cos B—aw sin B] A= Fy 

[(— mw?-L&) sin B-+-aw cos B] A=0. 


Squaring both equations and adding, we obtain 


Ae of 


V (we —w?) +4 a2w? 
where o=Y 4 is the frequency of the natural vibrations. From the second 
equation, we obtain the phase shift B: 


tan p= —-— : 
m (Wy — 0?) 
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From the first formula it follows that the amplitude A depends 
on @ as follows: When w<, the amplitude increases as w increases; 
when w=, the amplitude reaches a maximum; and when o>o, 
the amplitude decreases as o increases. This effect (sharpness of 
resonance) is more pronounced, the smaller the resistance constant a. 
When there is little friction, the resonance disrupts the system, 
for at a=0 the resonance amplitude goes to infinity. Engineers 
must take this into account in their 
calculations. To design a structure so 
that it is insensitive to the vibrations 
of its foundation, a resonance curve 
similar to the one shown in Fig. 43 
must be available. The lower curve 
shows the vibrations of the founda- 
tion and the upper one, of the struc- 
ture. At resonance, which occurs when 
the period of the vibrations is 0.32 sec, 
the amplitudes reach a value of 20- 
25 microns. This, in general, is no 
small amount. 

The sharpness of resonance is an 
indicator of still another important 
phenomenon, namely, the sharper the 
resonance, the slower vibrations of 
constant amplitude set in. 

Another feature of forced vibrations 
is the presence of phase shift. Until 

Fig. 43 now, we have assumed that the origin 

of the coordinate system was so select- 

ed that with respect to¢=0 the maximum displacement is in the 
positive direction. Naturally, if we are considering only one vi- 
bration, there is no need to select any other origin. However, if 
we are comparing two vibrations and pick the origin so that 
x=A when ¢=0, then the displacement of the other vibration 
at this particular instant may have an arbitrary value. This 
circumstance may be taken into consideration by introducing the 
phase shift B in the argument of the cosine. Thus, if «= 
=A cos (wt +), then x= AcosB at the instant of time ¢=0. The 
phase displacement is uniquely described by means of the phase shift B. 

Let us now return to resonance phenomena. The quantity B in 
the formula for forced vibration indicates that the phase of the 
forced vibration, generally speaking, is shifted with respect to the 
phase of the impressed vibration. The magnitude of the phase 


shift depends on ~ , the ratio of the natural frequency to the 
0 


20h 


= 


Amplitude 





0.30 0.40 
Period (sec) 


Sec. 27] Forced Vibrations 97_ 





external frequency, and also on the damping. Fig. 44 shows that 
a 90°-phase shift occurs at the resonance frequency, independent 
of the damping. The effect of the damping becomes clear when 
the situation somewhat removed from the resonance condition is 
considered. For weak damping (small logarithmic decrement 4), 
at frequencies somewhat below resonance, the phase shift is almost 


-B 


SS] 


N/a 


Phase LAG 





Fig. 44 


zero, while at frequencies somewhat above resonance, the phase 
shift is almost 180°. The same tendency exists for heavy damping, 
but it is not so pronounced. For a small amount of friction, one 
can say that a 180°-phase shift occurs when the frequency passes 
through the resonance condition. 

A simple experiment (Fig. 45) will demonstrate the essence of 
these interesting relationships. Suspend a weight by a string and 
allow it to swing freely. When the period of the free vibrations 
of this pendulum is manifested, stop the pendulum and by periodic 
motion of the hand bring it into a state of forced vibration. 
At first move the hand rapidly, so thatthe period of the 
natural vibrations is greater than the period of the forced 
vibrations, then move it slowly, so that the period of the 
natural vibrations is less than the. period of the forced vibrations. 
It will be seen that in the first case the pendulum and the hand 
are 180° out of phase, while in the second case they are in phase. 


Let us again consider the spring on page 93, which has a damping 
constant a=20 gm/sec, a mass m==50gm and a period Ty=0.28 sec (@= 
==22.4sec™1), When an external sinusoidal force of frequency @=@, acts on 
the spring, the amplitude of the forced vibrations is equal to A=4cm when 
the amplitude of the impressed force is Fy=1,790 dynes + 1.8 gm. A deviation 
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Fig. 45 


of the frequency of the impressed force from w, results in a change of the 
amplitude of the forced vibrations and achange in the phase shift B between the 
vibrations of the spring and the external force. The table shows the data 
obtained for various deviations by means of the formulas derived in this section. 





Freq. of external force Amplitude of forced Phase angle 
w (cps) vibrations A (cm) —8 (degrees) 

2 3.58 0°05’ 

10 3.95 0°35’ 

15 4,48 0°15’ 

22.4 4.04 90° 
30 2,48 178°50’ 
40 1.31 179° 10’ 


It is seen that in the presence of damping the maximum amplitude of the 
forced vibrations is reached when the frequency of the impressed force is 
somewhat less than the natural frequency of the vibrations. The weaker the 
damping, the smaller this shift in frequency. 


28. Self-Sustained Vibrations 


Fig. 46a shows a trough of triangular cross-section fixed on a 
shaft about which it can rotate. The trough has some particular 
period of free oscillations, which may be observed by swinging the 
trough away from its equilibrium position. The oscillations will 
continue as long as friction and air resistance do not stop them. 
Let us place the trough under a water faucet and allow the stream 
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of water to flow evenly on the wall of the trough, at a point 
somewhat removed from the centre line. It is not difficult to 
envisage what will ensue. As more and more water pours into the 
trough, the height of the centre 
of gravity rises until, finally, it 
exceeds the height of the shaft to 
which the trough is fixed. The 
pressure of the stream of water 
is now sufficient to upset the 
trough; whereupon, water flows 
out and the trough returns to its 
original position. This cycle keeps 
repeating as long as the stream 
of water continues to flow. 
Thus, the trough will oscillate. ges 
However, the character of the a) 
oscillations produced in this man- 
ner is quite different from the 
oscillations considered above. 
In the first place, it is impor- 
tant to note that the external 





Amount of water 


force is not of an oscillatory na- lime —> 
ture; i.e., it is a constant force 

(the pressure of a stream of wa- b) 
ter). Secondly, such a system exe- Fig. 46 


cutes undamped oscillations, al- 

though subject to the action of friction and other resistance. And 
finally, the resulting oscillations are not harmonic, i.e., they do 
not have a sinusoidal shape. Thus, in our example, the similarity 
to a sinusoid is nil. By conducting such an experiment, it can be 
shown that the dependence of the amount of water in the trough 
on the time may be represented by a saw-toothed curve similar to 
that shown in Fig. 460. 

The oscillations described above may be classified as self-sustained 
oscillations. Such oscillations constitute a distinct phenomenon, 
basically differing from free, undamped oscillations occurring with- 
out the action of a force, as well as from forced oscillations occurring 
under the action of a periodic force. The above example may 
appear to be artificial. However, self-sustained systems have byoad 
application and are very often encountered wherever mechanical 
and other oscillations occur. 

A simple pendulum clock (Fig. 47) executes self-sustained 
oscillations. As is well known, such a clock is actuated by a falling 
weight suspended from a chain that passes over a gear wheel. 
This wheel is located .on the same axis as a balance wheel, which 
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can mesh with a symmetrical anchor escapement. A pendulum 
is rigidly fixed to the escapement. At the instants when the 
balance wheel, which is driven by the gear wheel via a gear drive, 
touches the pallets of the escapement with its teeth, the pendulum 
is given an impulse. The rest of 
the time the pendulum and _ the 
escapement swing freely, while the 
balance wheel moves by itself. The 
escapement and the balance wheel 
are so constructed that the pendu- 
lum obtains two impulses each time 
the balance wheel advances by one 
tooth. One impulse is obtained when 
the pendulum moves from left to 
right, and the other when it moves 
from right to left. 
Fig. 47 The self-sustained vibrations of a 
clock are basically similar to those 
of the trough of triangular cross-section. The vibrations occur 
under the action of a constant rather than a periodic force, are 
undamped in spite of the presence of friction, and are not harmonic. 

In the above examples, a common property of self-sustained 
vibrations is manifested, namely, the property known as feedback. 
A pendulum executes undamped oscillations and causes a mechanism 
to give an impulse at appropriate moments. The mechanism pushes 
the pendulum and the pendulum provides feedback to the mechanism. 
If the pendulum stops, the impulses also cease. The oscillations of 
the pendulum are governed by the pendulum itself. 

Inexactly the same manner, the swings of the triangular trough are 
governed by the trough. The stream of water regulates the swinging of 
the trough, while the construction of the trough itself regulates the water 
inflow. 

A string struck with the fingers and then released is in a state 
of free vibration. The situation is different when a string is drawn 
with a bow. In this case, the string executes self-sustained vibra- 
tions that are saw-toothed in shape. The bow pulls the string along. 
When the displacement reaches a certain limit, the string separates 
away from the bow, returning to its original position. The bow 
again pulls the string along and the process is repeated. In the 
space of the second that the musician draws the bow, the phenom- 
enon is repeated hundreds of times. These are typical self-sustained 
vibrations since they are due to a continuously acting force. The 
string itself controls the vibrations by its elasticity. 

The squeaking sounds emanating from door hinges in need of 
oiling also belong to this class of vibrations. 
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We say that feedback occurs whenever an instrument or machine 
automatically introduces automatic corrections to its action when 
the operating conditions change. The principle of feedback is one 
of the fundamental concepts in automation. 


29. Addition of Parallel Vibrations 


In a number of cases, the problem arises of analysing the mo- 
tion of a body simultaneously executing two vibrational motions. 
Thus, an oscillating pendulum 
may be located on a vibrat- 
ing platform, or it may be 
on a rolling ship. 

If we are concerned with 
vibrations in a single direc- 
tion, the addition occurs as 
shown in the model in Fig. 
48. Two pendulums, in this 
case, Oscillate in parallel planes. 
A light rod lies freely 
on the pendulums and a re- 
cording pen is attached at its Fig. 48 
centre. As an approximation, 
we can assume that the pen will remain in a plane differing little 
from the planes of vibration of the pendulums and that the dis- 
placement of the pen at a given instant will be equal to the al- 
gebraic sum of the pendulum displacements. Another arrangement 
may also be used, e.g., a bal! oscillates on a spring suspended 
from a board and the board, in turn, is attached to a post by 
a spring in such a manner that the ball simultaneously executes 
two different vibrations in a single plane. 

If x, is the displacement of the first vibration in the absence 
of the second, and x, the displacement of the second vibration in 
the absence of the first, then, at each instant, for simultaneously 
occurring vibrational processes, 


X=X,+4%,. 





In the most general case, the component vibrations may differ 
in amplitude, frequency and phase. 

Let us first consider the case when the vibrations have equal 
amplitudes and frequencies, but are displaced in phase. Then, 


x, =A cos of, Xx, = A cos (wt + @) 


and x=x,+x,=2A cos $cos (of +$), 
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where o=, This means that the resultant vibration is also 


harmonic and has the amplitude 
2A cos +. 


Hence, it follows that the amplitudes of vibrations add arithmeti- 
cally when the vibrations coincide in phase and subtract when 
they are opposite in phase (p=180°). In the intermediate cases; 
the amplitude assumes a value between zero and 2A. In particu- 
lar, when m=120° the amplitude of the resultant vibration is 
equal to A. This is illustrated in Fig. 49. 





Another important case is the addition of vibrations having 
different frequencies. For simplicity, let us assume that p=0O and 
the amplitudes are equal. Then, 


x,=Acoso,t, x,=Acoso,t, and 
x= 2Acos 4% t cos Ot. 
In the general case, the vibrational motion obtained when such 
vibrations are added does not exhibit a distinct periodicity with 


respect to the displacement x. However, two particular cases de- 
serve special consideration. 
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First, let us consider two vibrations whose frequencies are close 
to each other. Then, o,—o,<o,+, and the displacement x is 
the product of two cosines, one varying rapidly with time and 
the other very slowly. Hence, 


2A cos a t 


may be considered to be the slowly varying amplitude of vibra- 
tions occurring with an average frequency Ogg = SEOs, The fre- 
quency of the slowly varying amplitude is known as the beat 
frequency. Fig. 50 clearly shows the two frequencies—the basic 
frequency of the vibrations and the beat frequency. 
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Fig. 50 


The second important case is the addition of two vibrations 
when one of the frequencies is a multiple of the other. It is quite 
evident that the resultant vibration will be periodic. If, for example, 
the period of one vibration is 3 sec and that of the other 7 sec, 
the resultant vibration will be repeated every 2! sec. This is shown 
in Fig. 51. 





Fig. 51 


30. Vibration Spectrum 


We have already spoken about vibrations that repeat with preci- 
sion every specific interval of time, but are not harmonic. For 
example, we have considered saw-toothed vibrations. If we ar® 
sufficiently exacting, it turns out that harmonic vibrations, i.e., 
those represented by sinusoids, are encountered in nature and 
engineering much less often than nonharmonic vibrations. 

At the end of the previous article, we noted that the sum of 
two sinusoids is a periodic vibration, even though not a sinusoid, 
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if one of the frequencies is a multiple of the other. Naturally, 
this is true for any number of harmonic vibrations and not merely 
for two. 


The sum of two vibrations having periods T and ta respec- 


tively, is a vibration having a period T. Furthermore, this is the 
period of the vibrations obtained by adding three vibrations 


having periods 7, 4T and ae respectively; also, four vibra- 
tions—with the additional vibration having the period Ee five 


vibrations—with the additional vibration having the period + T; 


etc. Converting to frequencies, this may be expressed as follows: 
The sum of any number of vibrations whose frequencies are mul- 
tiples of w, i.e., the frequencies , 20, 30, ..., is a vibration 
having the frequency o. 

Now, the following question naturally arises: By adding an 
arbitrarily large number of vibrations, whose frequencies are mul- 
tiples of » and whose amplitudes are selected as required, is it 
not always possible to secure any desired vibration, even saw- 
“toothed? Fourier, the French mathematician, proved that this was 
indeed the case. The theorem named after him states that it is 
always possible to select a,, a,, a, ... and @,, @,, @,, .-. in 
such a manner that any periodic vibration having the frequency w 
may be represented in the form of a sum of harmonic vibrations: 


Xx =a, cos (@t+q,) +4, cos (2w/ +-@,) +a, cos (Smt + ,) +... 


The frequency o is called the fundamental frequency, and the 
frequencies 20, 3, ... are the overtones or harmonics (e.g., 
second harmonic, third harmonic, etc.). The closer the curve of 
the vibrations approaches a sinusoid, the smaller the amplitude of 
the harmonics. On the other hand, if the curve of the vibrations 
hardly resembles a sinusoid, the amplitudes of some of the harmon- 
ics will not differ greatly from the amplitude of the fundamental 
frequency. 

Representing the vibration in the form of a sum of harmonic 
vibrations is called spectrum analysis. The spectrum consists of the 
data on the frequencies and amplitudes of the harmonic vibrations 
comprising the vibration of frequency . These data may be pre- 
sented in tablular form. If there are many frequencies, however, we 
usually resort to a graphical means of presentation (Fig. 52). 

The concept of spectrum may be extended to include nonperiodic 
processes. We may speak of the spectrum of elastic vibrations 
groduced by a blow on a table, the spectrum of the report of a 
pun, or the spectrum of an outcry. 
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To make this clear, let us first consider a process consisting 
of periodic, damped impulses. This is not the case of a single 
report or outcry, but of a series of reports or outcries, i.e., re- 
peated at regular intervals of time. Characteristic of this process is 
the rapid damping of such a vibration, whose curve is shown in 
Fig. 53a. The spectrum of this vibration may be established by 
existing means and has the form shown on the right in the figure. 
As was to be expected, we see that the spectrum is composed of 
frequencies that are multiples of the fundamental. It should be 
noted that the spectrum has a maximum, which occurs for the 
eighth harmonic. This is not accidental, for if we examine the 
vibrations depicted on the left in the figure, we see that within 
each individual impulse the damped pulse vibrates at a “frequency” 
that is 8 times greater than the frequency of the fundamental tone 
(Fig. 53a). 

Similar impulses are illustrated in Fig. 536, but in this case 
the frequency is one-half of the above. Compare the spectrum of 
this vibration with the previous one. Since the fundamental fre- 
quency is now one-half of the original, the “frequency” of the 
damped, elementary process (we have assumed that it has remained 
the same) will now be the 16th harmonic of the fundamental 
tone. The distribution of the harmonic amplitudes remains as 
before, but their number in the same interval of frequencies is 
two times greater. o 

It is easy to see now that the spectrum of a nonperiodic proc- 
ess—a single impulse—is continuous. Individual frequencies are 
not discernible (Fig. 53c), but the nature of the spectrum is very 
similar to that considered above. 

A mathematical proof of the above conclusions is contained in 
the theory of Fourier integrals. 


31. Addition of Mutually Perpendicular Vibrations 


To analyse a complex vibration consisting of the sum of two 
mutually perpendicular vibrations, it is best to use an electronic 
oscilloscope. We shall discuss this apparatus in more detail below 
(p. 459). For the present, it is sufficient to note that an_oscillo- 
scope enables us to depict the vibrations of an electron beam in 
two mutually perpendicular directions. The trace of an electron 
beam on a fluorescent screen describes a path that is the result of 
two mutually perpendicular vibrational motions of the beam spot. 

Let us assume that the vibration of the beam trace in the ver- 
tical direction is represented by the relation y=06cos(w¢+54), and 
in the horizontal direction by the relation x=acoswf. To deter- 
mine the nature of the resultant path, we must eliminate time from 
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the above equations and obtain an equation of the form f(x, y)=0. 
Writing the expressions for the displacements in the form 


= = Cos wf, 4 = cos (wf + 5)=cos wf cos6—sin ofsind and re- 
placing, in the second equation, coswt by = and sinw? by 


1—(= 7 we obtain after simple conversion the equation of 
an ellipse rotated with respect to the coordinate axes: 


x? , y® = 2xy 2g oe 
a a ees 6=sin’d. 
Let us now vary the parameters of the vibrations and see what 
happens to the ellipse. If we vary the phase difference, the ellipse 
will change its form and simultaneously rotate (Fig. 54). 





6=90° §=180 
Fig. 54 


When the phase difference is equal to 90° the axes of the ellipse 
coincide with the coordinate axes. If the phase difference is de- 
creased or increased, the el- 
lipse begins to rotate to the 
left or to the right, respec- 
tively, and simultaneously 
contracts. When the phase 
difference is reduced to zero, 
the ellipse degenerates into a 
straight line. The various cases 
can be checked by sub- 
stituting, in turn, the values 
6=0°, 90° and 180° in the 
above equation. 

If the amplitudes of the vibrations in the vertical and horizon- 
tal directions are equal, then for phase differences of 90° and 270° 
the path is a circle. There isa difference between these two phase 
differences in spite of the fact that the paths are identical. In one 
case, the beam moves around the circle in a clockwise direction, 





108 Vibrations [Chap. V 


while in the other, the motion is counterclockwise. To see this, 
let us return to the original equations. We obtain the following: 


for 90° x=acoswt, y=bcos(wt+ 90°); 
for 270° x=acoswt, y=6cos(wt+270°). 


The first pair of equations shows that for increasing time, at /=0, 
the point having the coordinates x =a, y=0 begins moving in the 
direction of negative y, i.e., clockwise. The second pair of equations 
shows that in this case the direction of motion is counterclock- 
wise. 

In viewing an oscillogram, it will be observed that the ellipses 
do not stand still, but slowly shift as though a continuous change 
in phase were occurring. Careful observation shows that the ellipse 
does not rotate, but the curve being traced by the beam spot 
seems to continuously shift from one ellipse to another. This phe- 
nomenon occurs when the frequencies of the vibrations differ some- 
what. In fact, a difference in frequency is entirely equivalent 
to a continuously changing phase difference. Let us assume that the 
frequency of the vertical vibration , is Aw greater than the 
frequency of the horizontal vibration ,. Then, 


o,t-+6=0,t-+(Aot-+ 4), 


where the variable phase difference is within the brackets. 

If the frequencies differ considerably from each other, before the 
beam is able to describe the major portion of one ellipse the 
phase has already changed. As a result, the described curves look 
less and less like ellipses. Examples of these queer curves are 
shown in Fig. 55 and are known as Lissajous figures. The depicted 
curves are for a frequency ratio of 3:4. 


CHAPTER VI 
TRAVELLING WAVES 


32. Propagation of a Disturbance 


Every body is elastic to one or another degree, i.e., every body 
is able to restore itself to its original form after being distorted 
by a force of short duration. This property is responsible for the 
fact that every mechanical action is transmitted by a body with 
finite velocity. If a perfectly rigid rod, incapable of being de-~ 
formed, existed, it would only be able to move as a unit, and the 
action of a force would dissipate in such a body instantaneously. 
If a perfectly plastic body existed, deforming without in the 
least restoring itself to its original form, it would be incapable of 
transmitting any mechanical action whatsoever. 

In an elastic body a disturbance is transmitted successively 
from one particle of a body to a contiguous one. The compression 
produced at the end of a rod due to the blow of a hammer is 
propagated along the body with a definite velocity c. If a bend 
of short duration is created at some point of a rigid body, this 
disturbance will also be transmitted with finite velocity through 
the body. The same is true of every deformation. Propagation 
through a body for various mechanical deformations is usually 
demonstrated by means of a spring (Fig. 56). 





Elasticity of compression and extension is a property of liquid 
and gaseous bodies, as well as of rigid bodies. Hence, these dis- 
turbances may be transmitted in all bodies. However, the disturb- 
ances produced by shearing, torsional and bending deformations 
may be transmitted only by rigid bodies possessing the correspond- 
ing elasticity. 

For compression and extension, the motion of the particles is 
in the direction in which the mechanical action is transmitted. 
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Hence, the propagation of the disturbance is said to be longitud- 
inal. In the case of shearing, torsion and bending, the direction 
of motion of the particles may -form, generally speaking, any 
arbitrary angle with the direction in which the energy is trans- 
mitted. 

We can always select the direction in which the mechanical 
action is being transmitted and then resolve the displacement of the 
particles of the body into three mutually perpendicular components, 
whereby one of the components is in the direction of propagation 
and the other two are in a perpendicular plane. Thus, in the most 
complex case, we can consider the propagation of a disturbance as 
hy of the sum of three motions—one longitudinal and two 
lateral. 

The velocity.of propagation of the disturbance due to an elas- 
tic deformation depends on the mechanical properties of the body. 
As is shown in theoretical physics, this velocity is related to 
other physical constants of the body. Thus, for longitudinal 
waves, it is given by the simple formula: 


1 
c=. 
Qx% 
Here, @ is the density of the body and x is the compressibility. 
A large density leads to an increase in the inertia of the body’s 
particles and, consequently, the velocity of propagation of the 
elastic waves decreases. The small values for compressibility 
indicate that even large elastic forces correspond to only small 
deformations. The smaller the compressibility, the greater the 
velocity of propagation of the disturbance. 

This is the form in which this formula is generally used for 
liquids. Thus, water compresses by 5X107* of its volume for a 
change in pressure of 1 atm. This means that the compressibility, 
equal by definition to 


1 Av ie oi eg OM aS 
u= —357, (see p. 157), is 10°F x 5 x 10-5. 


The density of water is 1 gm/cm*. Hence, for the velocity of 
propagation in water, we obtain 


c?=2 x 10" cm?/sec’, 
c=1,400 m/sec. 


For gases, it is convenient to convert the formula for velocity 
into another form. Since the process of transmitting compression 
in a gas is very rapid, the compression and expansion of a gas 
may be considered to occur adiabatically, i.e, without heat 
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exchange. We shall derive the equation of the adiabatic process 
below (p. 170), from which it is easy to obtain the following rela- 
tionship between the coefficient of compressibility and _ the pres- 


sure of the gas: naz, where y=1.4*, Thus, c= = For an 
ideal gas, the density gat is proportional to the fraction 


a , where p is the mass of a mole of gas andv is its volume. This is 


so because i = const. Hence the velocity of propagation in a gas is 


Va 
c= a-—-,. 
B 


Here, a is a constant whose value is easily calculated by means 
of equations considered below (p. 169). 

Thus, the velocity of propagation of the disturbance due to a 
deformation in a gas, including the velocity of propagation of sound 
waves, which will be discussed in more detail! later on, is proportional 
to the square root of the temperature and does not depend on the 
pressure of the gas. The dependence on the molecular weight is 
interesting. In hydrogen, the velocity of propagation is equal to 
1,263 m/sec, while in air, as is well known, it is 331 m/sec. 

For longitudinal waves propagating in a rigid body, the coeffi- 
cient of compressibility is usually replaced by the modulus of 
elasticity. Since, by definition, the modulus of elasticity 


EA apse, 


Basizt 


Sr, ke , . 3 1 
it is evident that in the absence of transverse motion =F 


for the linear compression is equivalent to the volume compres- 
sion. The formula for velocity may then be written as follows: 


Ve 
c= —. 
Q 


* The equation of the adiabatic process is pu’=const. If p and v are the 
equilibrium values of the pressure and volume for a certain mass of gas, and 
p+Ap and v—dAv the corresponding values at the instant of deformation, then 


(p + Ap) (v—Av)T = pot. 


Ap __ Av\~T__ so. 1(t—1) (40\2 . 2 
Whence, it oH 1——) =l-yot5 a =) >... Disregarding terms 


of higher order in the binomial expression, we obtain 





Av 1 
Ap=—yp > . Hence, ee 
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The table shows the extent of the agreement between calculated 
and experimental values: 











Young’s Modulus E phat al ey ati 
Glass ...... 7.8X 10° kg/cm? 2.4 5,700 5,990 
Stel ...... 2,200 kg/mm? 8 5,200 5,000 
Wood ...... 7.2X10® kg/cm? 0.7 4,130 4,200 
Water (13°C) . . .| x= 4.75107 atm™? Lb 1,450 1,440 


To check the formula for the velocity of propagation of sound, it 
is necessary to use samples in the shape of slim rods. This is due 
to the fact that a more thorough analysis of the problem indicates 


that the formula c= ye is only valid for such bodies. For 


bodies having other shapes, as well as for the propagation of 
sound in acontinuous medium, theory leads to expressions which 
we shall not introduce. 

It should also be noted that the values given in the table are 
only for guiding purposes. The velocity of sound in different types 
of glass, wood, steel, etc., differ considerably. 


33. Generation of Wave Motion 


Sustained vibrations may be applied to a particular point of a 
body or medium in a variety of ways. A force acting periodically 
at some point of a body produces a periodically varying deforma- 
tion whose disturbance is transmitted at a specific velocity from 
one point of the body to another. All the particles of the body 
participate in the vibratory motion. Since the velocity of prop- 
agation is finite, however, the particles of the body are set into 
vibration in consecutive order. If a body is infinitely large, such 
a vibration will advance continuously, forming a travelling wave. 

Infinitely large bodies do not exist. However, the actual length 
of a large body does not affect the nature of the phenomenon, 
for the vibrations do not reach the end in view of inevitable 
energy losses. 

Let us consider a wave travelling in a particular direction in a 
body that for all practical purposes is infinitely large. Assume 
that the particle located at the origin of the coordinate system is 
vibrating in accordance with the equation y—A cosof. Let us 
write the equation of vibration for a particle located along the 
line of propagation of the disturbance at a distance x from the 
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origin. It is not the same as for the particle at the origin because 
this particle began to vibrate after a delay of t=, the time 


required for the disturbance to be propagated the distance x. The 
vibration of particle x, therefore, is shifted in phase with respect 
to the vibration of the particle at the origin. At the instant of 
time ¢, the vibration of particle x will have the same phase as 
the vibration of the particle at the origin at an instant of time 


= earlier. Hence, the equation of vibration for a particle dis- 
placed by a distance x from the origin is 


y= Acos o( i—2), 


where = is the phase shift. 


The abave equation is known as the wave equation. It is valid 
for the vibration of any particle located at any distance from the 
origin. 

Let us assume that the source of the wave is far from the 
observer and that the wave front has long since moved ahead. 
We now consider a portion of the line along the x-axis subject to 
wave motion. At first glance, the introduction of a new concep! 
may appear unjustified. To be sure, vibrations occur at all points 
in this region. However, can we discern how the wave is moving? 
Is it moving to the right or to the left? Careful observation shows 
that the specific character of the wave motion is easily detected. 
If the wave is moving from left to right, the adjacent point on 





—»Motian of the wave Motion of the wave ~— 
Fig. 57 


the right will be delayed in phase with respect to the point on the 
left. In the reverse case, the point on the right will be ahead in phase. 
Fig. 57 shows waves travelling to the right and to the left. Each 
sinusoid represents a snapshot of the wave ata particular instant. 
At each succeeding instant, this sinusoid is displaced in its en- 
tirety in the direction in which the energy is being transmitted. 


5 —3580 
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It is clear from this that the direction of the wave motion 
affects the form of the wave equation. If the wave moves along 
the coordinate axis in the positive direction, a minus sign must 
precede the value of the coordinate x. If the wave moves along 
the coordinate axis in the negative direction, the sign in the 
argument of the cosine must be reversed: 


y=Acos o( t=); y=Acosa( t+), 


pos. direct. neg. direct. 


The wave equation at an instant of time equal to a multiple of 
the period reduces to 


x x 
y= Acoso = =A cos 21. 


The minus sign is dropped since the cosine is an even function. 
From this equation, it immediately follows that the period of the 
sinusoid is Pe 

=CcT. 


This spatial period, i.e., the distance covered by an undulation 
before it repeats itself, is known as a wavelength. We have thus 
arrived at the well-known relation connecting the velocity of a 
wave, its length, and the period of a vibrating particle. 

A number of physical quantities vary sinusoidally in the undula- 
tory transmission of a disturbance through a body, namely, the 
displacement of a particle from its equilibrium position, the veloc- 
ity of the vibrating particles, the pressure and the density. Herice, 
the above wave equation is very general. The quantity y may 
designate any of the enumerated physical quantities, which vary 
sinusoidally when the wave moves in the x-direction. It should be 
noted, of course, that the waves of pressure, velocity and displace- 
ment do not necessarily have to coincide in phase. For example, 
it is clear that the wave representing the velocity of the vibrating 
particles is shifted in phase by 90° with respect to the wave of 
the displacements, for the velocity of a particle is a maximum 
when it passes through the equilibrium position. 


34. Pressure and Velocity of Vibrations 


It is interesting to examine the relationships between the wave 
amplitudes of various physical quantities. For this purpose, we 
shall only consider longitudinal waves propagating in a gas and 
concern ourselves with waves of displacement, particle velocity and 
incremental pressure. Since the theory arose in connection with 
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auditory. waves, the incremental pressure Ap is often called the 
sonic pressure and is designated by p, the symbol A being dropped. 

If A is the amplitude of the displacement wave, then oA is 
the amplitude of the velocity wave.. These two waves are 90° out 
of phase. 

We shall now derive the relationship between the amplitude of 
the velocity of vibrations and the amplitude of the pressure. From 
the general definition of x as applied to gases (p. 111), we obtain 
for sonic pressure the formula 


A 
p=—yP >, 
where P is the pressure of the gas. Using the relation ca, 
we obtain 


p=—cto= 
uv 


It is perfectly natural that a direct connection should exist between 
the incremental pressure p and the relative compression in the 
gas at the same location. 

However, going a step further, the relative compression of the 
volume, “= , can be related to the displacement amplitude of the 
vibrating particles. Let us mark two points, x, and x,, along the 
line of propagation. For a longitudinal wave, changes in density 
occur as a result of displacements in the direction of propagation. 
Let us consider a volume of gas bounded by the cross-sections 
through x, and x,. When the wave moves, the molecules within 
this volume are displaced. However, it is only necessary to con- 
sider the situation at the limiting cross-sections. If the molecules 


of the layer through x, are displaced by y,=A cos « ( —+) and 


the molecules of the layer through x, by Ya= A cos « ( —=:) ; 


then the linear dimension of the volume, x,—x,, changes by the 
amount y, 7 The relative change in length, and hence in vol- 


ume, ise . Going over to the limit, in order to obtain a quan- 
tity deseriptive of a point in space, we obtain 


Av dy = @ 4. x 
oho Asine (2 ~=) 


and for the pressure 
. x 
p=ceAw'sin o (1 —) ‘ 


52 
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This shows that the pressure changes in phase at the rate of the 
particle vibrations in the wave. Aw=u, is the amplitude of the 
velocity of vibration. Thus, p,, the amplitude of the pressure, is 
related to u,, the amplitude of the velocity, as follows: 


Po = QCU,. 


In acoustics, wu is usually measured in cm/sec and p in dynes/cm?. 
Using these units, we obtain p,=4lu, for air at room temperature. 
The quantity ec is called the acoustic or wave resistance. The 
meaning of the designation is evident, namely, the greater the 
resistance the smaller the velocity of the vibrating particles for 
the same values of incremental pressure. 

The acoustic resistance of several materials is given in the table: 














p c 6c 

(gm/cm?) (cm: sec) gm/cm?sec 
Glass ... 2.6 5.5x 108 14105 
Steel ... 7.9 5x 105 40 10° 
Wood ... 0.7 4.2 10° 2.9 108 
Water ... 1 1.44 10° 1.4 108 


35. Energy Flux 


Wave motion transfers energy from one location in space to anoth- 
er. However, it should be kept in mind that every particle of the 
medium is involved in the transmission of energy and each con- 
tinuously vibrates about an invariable equilibrium position. 

Since all the particles of a body are involved in the vibration, 
the vibrational energy of a unit volume is 

Wimax 


2 ? 





where Q@ is the density, i.e., the mass of a unit volume, and 
Umax iS the amplitude of the velocity of vibration. Substituting for 
the latter quantity the familiar expression 


Umax = OA, 


where A is the displacement amplitude and @ is the frequency, 
we can write the density of the vibrational energy of a body in 
the form 
ew?A? 
w=" 3° 
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This energy propagates with a velocity c. The following question 
now arises: What is the expression for the wave intensity, i.e., 
the amount of energy passing in unit time through a unit area 
perpendicular to the direction of propagation of the wave? Instead 
of referring to the intensity of a wave, however, it is more usual 
to speak about the vibrational energy flux, meaning thereby the 
energy passing per unit time (power) through a given area. This 
approach is completely analogous to the analysis of the flow of 
- water in a pipe. In a unit time, a wave traverses a path c and 
transmits the energy contained in a cylinder of length c and unit 
cross-section. Since a unit volume contains the energy w, the energy 
in the above cylinder is wc. This is precisely the meaning of wave 
intensity: 

T=we. 


We see that wave intensity has the sense of energy flow through 
a unit area. This was first noted by N. A. Umov in his theoretical 
work on energy motion in bodies. 

Until now, it has been assumed that the wave motion propa- 
gates along a straight line. Such an assumption is useful when in- 
vestigating disturbances travelling along rods, strings, air columns, 
etc. However, we are also interested in investigating cases involv- 
ing three-dimensional wave motion. 

To describe a three-dimensional wave, we must know how the 
wave front moves. The wave front at a particular instant can be 
determined if all the points in space having the same phase of 
vibration are given. Noting the successive positions of this constant- 
phase surface, i.e., the wave front, we obtain a clear picture of 
the nature of the wave motion. 

Generally speaking, this surface may have any shape. In this 
case, then, what is meant by the direction of the wave propagation? 
It is natural to understand this direction to mean the normal to 
the wave front. 

If the medium is perfectly homogeneous and the wave emanates 
from some point in the medium, the wave front is spherical. Such 
a wave propagates along the radii from the centre. At large dis- 
tances from the centre of radiation, large portions of the wave 
front will appear to be in a plane—within experimental accuracy. 
In this manner, the concept arises of a plane wave propagating in 
a direction normal to the wave front. If the wave radiator is of 
linear shape, a cylindrical wave propagating along the radii of the 
cylinder is generated. Various types of waves are shown in Fig. 58. 

Disregarding all energy .losses that occur during the motion of 
a plane wave, we obtain that the quantity of energy passing 
through successive positions of the constant-phase surface remains 
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unchanged. Hence, the intensity of a plane wave will not change 
during the process of propagation. The situation is different, how- 
ever, with regard to spherical and cylindrical waves. Since the 
constant-phase surfaces increase in area as the square of the distance 
for spherical waves, and linearly as the distance for cylindrical 





Fig. 58 


waves, the intensities of these waves are inversely proportional to 
the square of the distance and to the distance, respectively. Only 
in this manner can the law of conservation of energy be satisfied. 

The intensity of a wave is proportional to the density of the 
vibrational energy, which is proportional to the square of the 
amplitude. Hence, the amplitude of a spherical wave is inversely 
proportional to the distance from the radiating centre, and the 
amplitude of a cylindrical wave is inversely proportional to the 
square root of the distance from the linear radiator. Thus, 
y= Cos @ (*—=) for a spherical wave and y= za cos o(t—t) 
for a cylindrical wave. Here, the distance r, just as previously x, 
is measured along the direction of wave propagation. 

Let us assume that a source of vibrations having a frequency of 1 kc/s is 
placed under water. The energy flux /=1 watt/em*%, Let us calculate the dis- 
placement amplitude A of the water molecules, their acceleration B and the 
amplitude @A=u, of the vibrational velocity. 

From the formulas of previous articles, it follows that 


u 2l : B= 2h mud =7 2 cm, 
A=> 90 10? cm; Bao x10 y= 2x10" 


pe sec’ 
23 m =] &@ 
For water, c= 1,450 — and g@=1 cate Hence, 
a . ~ PL ~jQ5m 
A=1,9X107? cm; B=740 G3 aye12 =. 
For the same energy flux and frequency of vibration, we obtain the follow- 


ing results in air, where c=330 = and g=1.293x10~%: 


A=0.11 cm; B=43x10' = = 43,000 =; uy =700 =, 
sec 


sec? sec? 
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36. Damping of Elastic Waves 


In actuality, waves propagating in a medium (solid, liquid or 
gas) decrease in intensity considerably more rapidly than indi- 
cated by the inverse-square law. This is due to losses in mechani- 
cal energy, i.e., transformation of mechanical energy into heat. 

The relation expressing the decrease in intensity of some partic- 
ular radiation in passing through a medium may almost always 
be obtained by reasoning as follows (for any medium and any 
radiation): If a wave passes through a layer of thickness dx, the 
intensity loss should be proportional to the intensity of the inci- 
dent wave and the thickness of the layer, i.e., d/=—pldx. 

This equation may be integrated. Assuming that the intensity 
is equal to 7, at the point x=0O and to / at the point x, we 
obtain a relation that. is valid for finite distances: 


~ 


x 
F=—pS dx, ie tales 
Qo 


oF 


Thus, the intensity of the wave decreases exponentially. 

In acoustics, it is convenient to use the concept of amplitude 
damping. Since the intensity is proportional to the square of the 
amplitude, the amplitude damping is expressed by a relation that 
differs from the above only in that the coefficient of damping (or 
absorption) is one-half of the value given there: 


1 
—— pe 
A=Ae *. 


Let us examine the absorption coeficient p {or +h somewhat 


closer. It is measured in reciprocal centimetres (for the exponent 
must be a dimensionless quantity) and is equal to the reciprocal 
of the thickness for which the intensity or amplitude of the radia- 
‘tion decreases by the factor e. : 

Naturally, the exponential damping relation is only a partial 
solution to the problem of the absorption of elastic waves by a 
medium. The determination of the dependence of the absorption: coef- 
ficient on the properties of the medium and the radiation frequency 
is a more formidable aspect of the problem. * 

It has been found that for many materials the damping.of an 
elastic wave (most of the available data being for sound waves in 
air) increases with the frequency of the vibration, The relation for 
the absorption coefficient has been determined to be 


p=ao’*. 
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For air, a=4x107** sec*/em. Thus, over a distance of 1 km, a 
plane wave having a frequency of 100 cps decreases by a factor 
of ~1.015, while. a very high audio frequency of 20,000 cps de- 
creases by a factor of 10°! Ultrasonic vibrations are damped so 
rapidly that their transmission over a distance of more than sever- 
al hundred metres is completely impractical. 

However, the monotonic relationship between absorption and 
frequency is not always satisfied. Some materials exhibit selective 
absorption of sound in a relatively narrow frequency band. Thus, 
the absorption of ultrasonic waves by carbon dioxide is a maximum 
at frequencies near 277 kc/s. The parabola calculated in accordance 
with the formula »=aw’ closely matches the experimental data 
in all regions except in the band indicated above. At frequencies 
close to 277 kc/s, the absorption is about 20 times greater than 
that calculated assuming a parabolic relationship. 

The dependence of the absorption coefficient on the properties 
of the medium can be expressed as follows for longitudinal waves 
in gases and liquids: the absorption coefficient is inversely propor- 
tional to the cube of the velocity of the elastic wave and directly 
proportional to the kinematic viscosity. As a result of this strong 
dependence on the velocity of propagation, and the fact that the 
kinematic viscosity of air is large, the absorption of sonic and 
ultrasonic waves in a liquid is about 1/1,000 of that in air. This 
means that for the same frequency elastic waves will propagate 
1,000 times further in water than in air. 

The absorption of transverse waves in solid bodies is also strongly 
dependent on the properties of the body. Thus, the absorption in 
rubber, cork and glass is, respectively, 13,000, 8,500 and 130 times 
greater than in aluminium. 

Due to their complexity, we shall not go into the theories of 
elastic wave absorption in bodies. 


37. Interference of Waves 


If there are several sources of waves instead of just one, then 
each point of the medium is simultaneously subject to several wave 
motions. It turns out that it is always possible to consider the 
vibration of a physical quantity due to the action of several 
waves as the sum of the vibrations that would occur if each wave 
acted independently. 

Let us assume that spherical waves emanate from two points 
located at a certain distance from each other. By means of the 
wave equation, we can determine the value of the vibration ampli- 
tude at any instant of time for any neighbouring. point. If the 
point in question is located at a distance r, from the first source 
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of waves and at a distance r, from the second source, the vibra- 
tion there is represented by the formula 


y =A cos 2n (ve—4) -+-A cos 2m (vt—+) : 


The result obtained by adding two vibrations differing only in phase 
is, as we know, also a harmonic vibration whose amplitude is 


2A cos s which can be seen to depend on the phase difference 


between the component vibrations. The phase difference 6 is equal 


in this case to 
20 (25%) ‘ 


Thus, generally speaking, all points of the wave field under 
consideration will be in vibration. However, the amplitudes of these 
vibrations will be different at different points. Two extreme cases 
deserve attention. First, let us consider the points at which the 
component vibrations annul each other. These points satisfy the 
condition 

On A = (2k +1) x, 
where k=O, 1, 2, 3, ..., i-e., the phase difference is equal to 
an odd multiple of x. On the other hand, if 


on ats Qk, 


ie., if the phase difference is equal to an even multiple of x, 
the amplitudes of the vibrations will add arithmetically. Thus, in 
this case, the amplitudes reinforce each other to a maximum degree. 

The difference r,—r, may be called the path difference of the 
waves and this term needs no further explanation. The conditions 
of maximum and minimum amplitudes may be formulated some- 
what differently by means of this concept. The maximum condition 


rr, =n 
states that the path difference between waves arriving at a given 


point must be equal to an integral number of wavelengths. The 
minimum condition 


r—r,= ek +1) 


states that the path difference must be equal to an odd number of 
half-wavelengths. These conditions are very easily visualised as 
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follows: The waves reinforce each other when one crest is superim- 
posed on another, and annul each other when a crest is superim- 


y d=32\ 


- d=5/2A 
“ d=2Aa 





Fig. 59 


posed on a trough or 
node. 

The superposition of 
waves, i.e., the addi- 
tion of their amplitudes, 
leads to interference. 

From analytic geomet- 
ry we know that a hyper- 
bola is a curved line 
satisfying the condition 


~d=-32a that the difference of the 
s-2a distances from any point 

“ d=-5f2a on the curve to two foci 
d=-3A% is aconstant. If we pass 


a plane through the point 
sources and note in the 


diagram points of maxi- 
mum reinforcement and those of wave annulment, the points 
will fall on hyperbolas. The corresponding curves are shown in 
Fig. 59. Such a picture can be easily observed on the surface of 
water when two sources sending out ripples from neighbouring 
points set up an interference pattern. 
We can use the above method to analyse the interference of any. 
number of wave sources. 


38. Principle of Huygens-Fresnel. 
Reflection and Refraction of Waves 


The complete equality of all the vibrating points of a wave 
field is striking. They differ only with respect to phase. As a result, 
it is natural for the following idea to emerge: It should be possible 
to consider any point of a wave field as an independent source of 
spherical waves. 

The validity of this idea first formulated in 1690 by Christian 
Huygens, may be tested by attempting to construct a wave front 
from data on a wave field on a boundary surface. It is necessary 
to take into account that there will be interference between the 
individual spherical waves (also called elementary waves or wave- 
lets). Huygens’ principle, supplemented by Fresnel, shows that 
such a construction is possible. 

What is the significance of this principle? Let us assume that 
the wave falls on an opaque screen having several apertures. By 
means of the principle of Huygens-Fresnel, we can map the wave 
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field beyond the screen without knowing anything about the sources 
of the field. It is sufficient to know the intensity of the field in 
the plane of the screen and assume that a spherical wave propa- 
gates from each point on the screen. The amplitude of the wave 
at any location in space is determined by adding all the 
wavelets coming from the apertures in the screen. 

Postponing consideration of the problems related to the passage 
of waves through a screen (problems which are mainly of interest 
in connection with light waves), we shall now apply the principle 
of Huygens-Fresnel to the explanation of the phenomena of wave 
reflection and refraction. 

Let us consider a portion of a plane wave incident on the 
boundary between two media. As is well known, a wave of any 
origin is reflected at an angle equal to the angle of incidence. But 
why should this occur? Huygens’ principle gives the explanation. 
Every point on the boundary between the media may be considered 
as a wavelet source. The first wavelet emanates from the point 
first reached by the incident wave. Successive points on the bound- 
ary will then be excited, the last point to start vibrating being 
the one last reached by the incident wave. Fig. 60 shows the 





Fig. 60 


positions of the wavelets for the instant of time when the incident 

. wave reaches the last point. The wave front generated by the 
wavelets forms an angle with the boundary equal to that of “the 
incident wave. Thus, the propagation velocities of the -incident 
wave and the reflected wave are the same. This means that the 
radius of the largest sphere rust be equal to the path traversed 
by the incident wave from the instant the first point was excited 
to the instant the last point was excited. 
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In exactly the same manner, the wave 
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of lesser density, it will be 
deflected away from the nor- 
mal. The ratio + =n is known 


; Cs | a 
as the refractive index. aR 
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etry of reflection and = re- «gsricmeiiys 
fraction may appear to be a 
somewhat uninteresting appli- Fig. 63 
cation of the theory. How- 
ever, the wave theory enables us to do considerably more, namely, 
ascertain the relative proportions of the: reflected and refracted 
waves as functions of the properties of the media whose inter- 
face is being considered. In order to simplify the calculations, we 
shall limit ourselves to the simple case of the normal incidence 
of a longitudinal wave on the interface of two media. The nature 
of the proof, however, is the same for all conceivable cases. 
In a discussion of this type, the following is axiomatic. At the 
boundary between two media, neither wu, the velocity of the 
vibrations of particles, nor p, the incremental pressure, can change 
abruptly. It is intuitively evident that it cannot be otherwise, but 
this can be shown rigorously on the basis of fundamental laws of | 
physics. 2 oe 
On one side of the boundary, there are waves having the in- 
stantaneous velocities u;,,;4 and U,.,..;. On the other side of the 
boundary, there is also the wave having the instantaneous velocity 
Uresract: Lhe continuity of velocity yields the condition: w;,.j4+ 
+ Ure prect =Uresracts the continuity of pressure yields: w;,,;¢0,C,+ 
+ Ure ptect O1€1 = Ure sractQe©,. However, examining these two equations, 
we see that they are incompatible since e,c,#0,c,. How is this 
to be explained? The answer is that we have forgotten that the 
instantaneous values of the velocities and pressures are vector 
quantities and that even in the simple case when the displace- 
ment vectors are in a single plane the amplitudes may differ in 
sign. It can be seen that the equations become compatible only if 
the amplitudes of the velocity and pressure for the reflected wayes 
are given opposite signs. The equations of continuity are then 
written in the form ine 
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We leave it to the reader to show that all other combinations of 
signs will fail to make the equations compatible. 

Since the amplitudes are positive quantities, the sum must be 
greater than the difference. Hence, the first pair of equations is 
valid when Q,c, >,¢, and the second pair is valid for the reverse 
case. The first pair of equations arises when all the vibration 
velocity amplitudes are in one direction and the phase of the 
reflected pressure wave differs by 180°, i.e., the amplitude of the 
reflected wave is oppositely directed with respect to the incident 
and refracted waves. The second pair corresponds to the reverse case. 


Pie > Pata 101 < Pale 
Velocity wave Pressure wave Velocity wave Pressure wave 
incident > . incident > incident > incident > 
reflected + reflected <- reflected <— reflected — 
refracted > refracted > refracted > refracted — 


This interesting phenomenon of amplitude vector reversal in 
reflection may be described as a one-half wavelength loss or a 180° 
phase jump. Thus, the change of sign in the wave equation 
y= Acoso t—= , where y is any physical quantity, may be 
obtained by introducing a 180° phase shift in the argument of the 
cosine. On the other hand, a 180° shift in phase is equivalent to 
displacing the wave distribution by one-half wavelength. 

Thus, at the interface of two media, the incident and reflected 
waves act either to reinforce each other or to annul each other to 
the maximum extent possible. 

It should be recalled that in reflection a one-half wavelength 
loss occurs for the vibration velocity wave when it enters a medium 
of greater resistance (sometimes inaccurately expressed as a medium 
of greater density). The displacement wave is inseparably linked 
with the vibration velocity wave and also suffers a one-half wave- 
length loss. 

In entering the second medium the wave does not execute 
a phase jump. 

Solving the above equations simultaneously, we obtain the expres- 


U, 
sion for the reflection coefficient r, ic., —tefect, 


us: 
Thus Qi —Os6s incident 


~~ Ql1+Q:62” 
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where r is always>0. Similarly, the refractive index g, i.e., 
Lrefract : is 
“incident 20,¢; 

b= oe + en, ‘ 

The wave resistance in air is much different from that in solid 
bodies. As indicated above, ec=41 for air, while for steel (9= 
=7.9gmjcm* and c= 5,000 m/sec) ec = 40 x 10°. Thus, r= 0.99999. This 
means that sound incident from air on steel is practically com- 
pletely reflected and in effect does not penetrate the latter. It can be 
easily calculated that at the boundary between air and water 
r= 0.9997. 


40. The Doppler Effect 


Until now it has been assumed that the wave source and the 
receiver (i.e.,the observer) were both stationary with respect to 
the medium in which the wave was propagating. Various effects, which 
were first noted by Doppler (1842), occur when the source or the 
observer, or of course both, move with respect to the medium. 
They consist basically in the fact that when the wave source 
moves the observer measures the vibration frequency v’ and when 
the observer moves he measures the vibration frequency v”. These 
frequencies differ from each other and from the frequency v_ that 
is measured when the observer and the source are stationary. 

In considering the Doppler effect, it is necessary in the first 
place to note that the wave leaving the source propagates entirely 
independently of the motion of the source and the observer. There- 
fore, in moving relative to the medium, the source or the observ- 
er may approach or recede from the moving wave. 

Why does such motion lead to the measurement of a frequency 
that differs from its “real” value? This is because the observer 
determines the vibration frequency as the number of waves enter- 
ing his apparatus per unit time. On the other hand, the formula 


v => gives the number of waves emitted per unit time. If the 


observer moves toward the source with the velocity u, then in 
‘1 sec the number of waves that he measures is not v, but a num- 
ber larger than this. Moreover, the ratio of c-++u, the relative 
velocity of the wave and the observer, to c is the factor by whtch 
the measured value is larger. Thus, 

vi ctu, F u 

Sots weav(1+¢). 


c 





If the source moves toward the receiver, the observer will again 
register a larger number of waves than when the source and the 
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receiver are stationary. However, in this case, the reason for the 
increase is different. 

This is not evident at first glance. The motion of a source 
having a fixed frequency of vibration leads to a change in the 
distances between points of equal phase of the wave. If the first 
case is considered to be crudely analogous to the motion of an 
observer toward a column of athletes running at equal velocities 
and maintaining the constant distance 4 between them, then the 
second case clearly requires a different interpretation. This case 
can be visualised as the slow displacement of the line of start. At 
equal intervals of time, the runners jump from an automobile 
moving along the track, which leads to a change in the distances 
between them. This distance becomes A”, not 4’. If the line of 
start (the source) is displaced in the direction of the observer and v 
runners start each second, then in 1 sec they are distributed over 
a distance given by c—u. Thus, the interval between runners 


(wavelengths) is A” =°—". The frequency at which the runners 


moving with velocity c cross the finish line, i.e., the vibration 
frequency perceived by the observer, is 





” Cc. 
Vea W=v 





“? 


[oe 


Both of the above formulas are also valid when the source and the 
observer are moving apart. In this case, it is merely necessary to 
reverse the sign of u. 

Thus, it has been shown that when the source and the observer 
approach each other the measured frequency of vibrations radiated 
by the source increases. When the source and the observer move 
apart the frequency decreases. 

A well-known example of the Doppler effect is the change in 
sound of the whistle of a locomotive as it passes an observer. When 
the locomotive approaches the observer the frequency of the sound 
is higher than the real frequency. The pitch changes abruptly when 
the locomotive sweeps past him. In receding, the frequency of the 
sound perceived is lower than the real frequency. For a train 
moving at a velocity of 70 kmjhr the jump amounts to~12 per 
cent of the real frequency. 


CHAPTER VII 
STANDING WAVES 


41. Superposition of Two Waves Travelling 
in Opposite Directions 


Let us assume that two plane waves having exactly the same 
characteristics are moving in opposite directions. We are interested 
in the resultant vibrational motion of the medium in which the 
waves are propagating. 

As indicated above, a difference in the direction of propagation 
is taken into account by a difference in the coordinate signs in 
the wave equation. The resultant displacement should, therefore, 
be given by the expression 


y=Acoso(t—=)-+Acosa(¢+ 2) =2A cos cos wt = 


=2A cos ae cos wt. 


This result is very interesting, for the sum of two travelling 
waves has not yielded wave motion. The formula obtained indi- 
cates the presence of vibrations of amplitude 2A cos, whose nu- 
merical value depends on the location in space. We call this pe- 
culiar vibrational state of the medium a standing wave, which 
arises whenever two identical travelling waves move in opposite 
directions. It should be emphasised that a standing wave is not 
a wave in the usual sense. A travelling wave transfers energy from 
one point to another, but this is in no way true of a standing 
wave. A travelling wave can move to the right or to the left, but 
a standing wave has no direction of propagation. The adopted des- 
ignation merely characterises the vibrational state of the medium. 

What are the characteristic features of this vibrational state? 
In the first place, we see that not all the particles of the medium 


vibrate. At the points in space satisfying the condition xa, 
ae ae .., the vibration amplitude is equal to zero. These points 
are known as the nodes of the standing wave. The distance be- 
tween two adjacent nodes along the x-axis, the direction of propa- 
gation of the travelling waves, is equal to one-half wavelength. 
Between every two nodes is a point that vibrates with a maximum 
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amplitude of 2A. Such points are called the antinodes of the stand- 
ing wave. 

Fig. 64 shows the vibrational state corresponding to the stand- 
ing wave at several successive instants of time. We see that the 
adopted designation is fully justified. At each instant a wave can 

be seen, but the wave does not move. 
9 A. series of consecutive snapshots 
will show that the points of inter- 
section of the wave with the ab- 
scissa, i.e., the nodes, remain fixed. 
The wave is stationary and the only 
change occurring between snapshots 


XY NY is a change in the magnitude of 
the displacements. At a certain in- 
stant, all points of the medium are 


=; motionless. After this instant of 
time, points that previously diverged 
upwards will now diverge down- 
wards, and vice versa. It is evident 


~¢ that this picture has nothing in 
common with the travelling wave 
shown in Fig. 57, where two com- 
parable “snapshots” are depicted. 


x Chere the wave is seen to move. 
At each successive instant, the max- 
ima and minima of the wave pass 
to new locations. 

We stated that no energy trans- 
fer occurs in a standing wave. Then 
how can we describe, in terms of 
energy, the processes occurring in 
this peculiar vibrational motion? 
Clearly, the energy of a standing wave (for some region in which 
it exists) isa constant quantity. At the instant when all the par- 
ticles are passing through the equilibrium position, all the energy 
of the vibrating particles is kinetic. On the other hand, at the 
position of maximum deviation of the particles from the equilib- 
rium position, the energy of all the particles of the body is potential. 

A standing wave is a very important vibrational process. Stand- 
ing waves of different types arise in bodies of limited dimensions 
through which elastic waves are propagated. This is because elastic 
waves are reflected back from the boundary into the body. A com- 
plex vibratory state arises in the finite body which is the result of 
tne superposition on the source wave of all the waves that were 
reflected irom the walls. Several typical cases will now be considered. 


y 





Fig. 64 


Sec. 42] Free Vibrations of a Rod 131 


42. Free Vibrations of a Rod 


By means of a blow or other means, it is possible in any solid 
rod to excite a longitudinal wave that propagates along the length 
of the rod. From the opposite end of the rod, this wave is re- 
flected and in this manner the entire rod is put into a vibratory 
state represented by a standing wave. The state will be one of 
free vibrations since it arises as the result of an impulse of short 
duration and continues without the action of external forces. We 
can predict the behaviour of these free vibrations if we know the 
length of the rod and how it is fixed. These data are known as 
the boundary conditions. It will be found that a node of the stand- 
ing wave is located at the point where the rod is fixed and an 
antinode of the standing wave is located at the free end. 

We shall now consider several modes of excitation of free, 
longitudinal vibrations in a rod of length L. 

A Rod Fixed at Both Ends. In this case, nodes of the displace- 
ment wave are formed at the ends of the rod. Since the distance 
between nodes is equal to one-half wavelength, the wavelengths 
that are possible, in terms of the length of the rod, are given by 


the condition L=nk, i.e., haa, where n is any integral 


number. 
Using for the velocity of an elastic wave the expression 


c= Ve, and recalling the relationship between frequency and 


wavelength, we obtain the expression for the natural frequencies 
of the free longitudinal vibrations of the rod: 


y =e 2 
n ~~ OL e° 


The qualitatively new content of this result should be noted. 
A solid body does not have one, but a multiplicity of natural 
(characteristic) frequencies of vibration. Hence, a rod can execute 
a variety of free vibrations. It is also possible for a rod to per- 
form nonharmonic vibrations having any arbitrary spectrum * con- 
sisting of the frequencies v,,. 

The frequency v, is the fundamental frequency of vibration of 
the rod. It corresponds to the vibratory motion for the condition 


L=4. This means that for the fundamental vibration an. antinode 


* The word “spectrum” is used quite often in physics to denote a sef of par- 
ticles having different velocities, masses, etc., or a set of waves having diter- 
ent wavelengths (frequencies), etc. 
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of the standing wave is at the centre of the rod and there are no 
nodes between the ends. The vibrations of the second overtone 
(second harmonic) correspond to the condition L=A. Now, there 
is a node at the centre of the rod. If the third harmonic is ex- 
cited, there will be two nodes between the ends of the rods, etc. 


Example. For a stee) rod (g@=7.7 gm/cm* and E=21,000 kg/mm?) whose 
length is 7 metres, the fundamental frequency is v,=365 cps. 


A Rod Free at Both Ends. If a rod is suspended by a thin 
string and then vibrations excited in it, the resultant standing 
wave must satisfy the condition that antinodes are located at both 
ends of the rod. Just as in the previous case, the connection be- 
tween the length of the rod and the wavelength is expressed by 


the relation: L=nk. Hence, the formula for the natural frequen- 


cies will also be the same. 

The difference between this case and the previous one is in the 
distribution of the nodes and antinodes. For the fundamental vi- 
bration, the centre of the rod is at rest (node). If the second har- 
monic is excited, there will be an antinode at the centre; one- 
quarter wavelengths away—nodes; and at the ends—antinodes. 

A Rod Fixed at One End. In this case, there will be a node at 
the fixed end and an antinode at the other end. For the funda- 
mental vibration, the rod has a form’ corresponding to one-quar- 
ter of a period of a sinusoid. Since the distance between a node 


and an antinode is equal to the relationship between the 


wavelengths and the length of the rod is given by the condition 
Lan, where n=l, 3, 5,... 


. The natural frequencies of the vibrations of such a rod are 
given by the formula 


E 
v.=tV = (n=1, 3, 5, ashes 


In the first two cases, the frequencies are related to each other 
as the whole numbers. Here, they are related to each other as the 
odd numbers. 

A rod fixed at the centre will have a node at the fixed point 
and antinodes at ihe ends. The problem is essentially the same as 
above. 

The boundary conditions used in the consideration of the vibra- 
tory state of a rod are an extreme case of the boundary conditions 
for reflected waves, considered on p. 125, As was explained ear- 
lier, reflection from a boundary separating one medium from an- 
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other medium of greater resistance is accompanied by a loss of one- 
half wavelength in the displacement wave. If the rod is fixed, the 
wave does not penetrate the second medium at all. In this case, 
the second medium can be said to have an infinitely large resist- 
ance. The coefficient of reflection is equal to unity and the re- 
flection is accompanied by a loss of one-half wavelength. It is not 
difficult to see that this corresponds to the presence of a node at 
the boundary between the two media. The reflection of the wave from 
the free end of the rod corresponds to ‘reflection from a medium 
having zero resistance. A reflection coefficient equal to unity and 
the absence of a half wavelength loss leads us to conclude that 
an antinode must exist at such a boundary. 

Free longitudinal vibrations may also be excited in columns of 
liquid and columns of gas. 

Free lateral vibrations are easily excited in a string under ten- 
sion. The distribution of the nodes and antinodes will naturally be 
the same as for a rod fixed at both ends. The set of frequencies 
is expressed by a formula analogous to that derived for the rod, 
the oniy difference being that in the expression for the velocity of 
the lateral wave it is necessary to replace F by the tension, i.e., 
the force stretching the string divided by the cross-section. 


43. Free Vibrations of Two-Dimensional 
and Three-Dimensional Systems 


In rods, strings and air columns, the constant-phase surfaces 
of equal phase consist of parallel planes. The vibratory state may 
be conceived as the result of superimposing plane waves extending 
along a single line. However, more complex cases are possible. 
Thus, we can have vibratory motion encompassing a two-dimen- 
sional region, e.g., plates and membranes, or encompassing a body 
whose three dimensions are of equal order of magnitude. 

The vibration of elastic and rigid diaphragms is a two-dimen- 
sional problem. A plate fixed at its edges will have a diflerent mode 
of vibration than a plate fixed at a single point or not fixed at 
all. Apart from the vibration of rigid plates, vibration of stretched 
nonrigid films, e.g., rubber and soap films, is also encountered. 

In principle, the general behaviour of the free vibrations in this 
case does not differ from that already considered. Since this $s a 
two-dimensional problem, the nodes and antinodes will now con- 
sist, in general, of curved lines. For example, the fundamental 
vibration of a circular plate fixed along its circumference has a 
single antinode (a point in this case) at the centre of the circle, 
i.e., the central point vibrates with maximum amplitude. As we 
move toward the edge, where a nodal circumference is located, the 
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amplitude gradually decreases while maintaining circular symmetry. 
This is the simplest case, namely, vibration of the fundamental 
(lowest) frequency. A membrane may be excited at a higher har- 
monic; in such a case the surface is broken up by nodal lines. It 
turns out that nodal lines in a circular plate may have a circular 
form or consist of diameters passing through the centre. 

The demonstration of nodal lines by the Chladni method (named 
after the scientist who proposed it) is an effective and simple ex- 
periment. A plate sprinkled with sand is put into a vibratory state 
by means of a blow or a fiddlestick. The sand rolls away from the 
antinodes and gathers along the nodal lines. Fig. 65 shows several 
Chladni figures. 





Fig. 65 


The vibratory state of a solid three-dimensional body is, of 
course, the most complex. We shall avoid consideration of this 
phenomenon in a body of complex form and restrict our study of 
such free vibrations to a right-angled parallelepiped. If the stand- 
ing waves in such a body were only due to the superposition of 
waves travelling parallel to an edge of the parallelepiped, the nat- 
ural frequencies of the vibrations would be limited to the values 


2? Di? GU," 


where n,, n,, n, are arbitrary whole numbers, and /,, /,, /, are 
the lengths of the edges of the parallelepiped. 

However, the waves propagating in the body may form, in gen- 
eral, any angle with the boundaries. In this case, the standing 
waves are formed after a number of reflections, when the beam 
returns to the exact point from which it left. Such an undulatory 
process may be conceived as the superposition of waves of different 
frequencies travelling along different edges of the parallelepiped. 
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Their natural frequencies of vibration are: 


_ nye __ Nee = 
‘=o Maroy v= 


nse 
2i,* 


and the wave numbers are 


ens — Ja — ts 
R=o > R= 9 k,=o- 


In superimposing the waves, k is determined by addition, but it 
should be noted that the addition is vectorial. Thus, 


ka BRE, i.e., oy 2 


It is evident that the frequencies of vibration for the simple cases 
of wave propagation parallel to the edges of the body are also ob- 
tainable from this formula; in such a case only one of the three 
whole numbers in the formula is different from zero. 

The vibration spectrum of a three-dimensional body is depicted 
in Fig. 66 in three-dimensional space, which may be called fre- 
quency space or inverse space. Here, the quantities oa 7 x 


are plotted, respectively, along the three axes. Each node in the 





Fig. 66 


lattice (inverse lattice) thus formed represents one of the natural 
frequencies of vibration of the body for the numbers n,, n,, n,. 
The radius vector drawn to a node of the lattice in the inverse 
space represents a possible vibration frequency. A sphere of radius 
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v includes all points corresponding to frequencies less than v. The 
volume of such a sphere is equal to Aw and the volume of each 


cell of the inverse space is equal to ($) fe. where v is the volume 


of the body. Therefore, the number of free vibrations of a body 
with frequencies less than v (the number of nodes in an octant of 
the sphere) is expressed by the formula 


4 vs 
3 MUX 


This interesting relationship shows that the number of natural 
frequencies increases sharply as the band of frequencies being con- 
sidered increases. For high frequencies, the discrete character of 
the spectrum becomes blurred, for the frequencies are very close 
to each other. 


44. Forced Vibrations of Rods and Plates 


If the vibration of a rod, plate or other body does not take 
place in vacuum but in some medium*, namely, liquid or gas, 
then a fraction of the intensity, depending on the ratio of the 
wave resistance of the contiguous media, is transferred from the 
vibrating body to the medium. This idea can briefly be expressed 
as follows: a vibrating body radiates energy. Due to radiation, 
the free vibrations of a rod, string, etc., are rapidly attenuated. 
If it is required that the body be a constant source of radiation, 
the vibration must be excited by an external source. Just as in 
the case of particle vibrations, the energy may be provided either 
by means of self-sustained vibrations or by producing forced vibra- 
ions. 

Depending on the method of providing the external energy and 
on its point of application, one can excite, generally speaking, 
one or more of the natural frequencies of a body capable of vi- — 
brating. One can, for example, produce forced vibrations in a string 
under tension in the following manner. An electromagnet fed by 
sinusoidal current from an audio generator is fixed about a steel 
wire. The vibrations of the wire under the action of the periodi- 
cally varying external lateral force become perceptible only at 
resonance. By varying the tension of the wire and the external 


* One must become reconciled to the fact that vibration of a body is used 
in two serses—vibration of a body as a whole and vibration of the particles 
of a body with respect to one another. PS oe . 
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frequency, one can show that the wire will vibrate at the funda- 
mental frequency as well as at various overtones. ; 
The production of forced vibrations (standing waves) in piezo- 
electric plates and ferromagnetic rods is of great practical impor- 
tance. Such vibrating bodies are generators of ultrasonic waves. 
Ferromagnetic bodies may elongate or shorten under the action 
of a magnetic field. The theory of this phenomenon is complex 
and will be.treated only briefly in 





this book. For the present, it is ‘& jg m 
sufficient to illustrate how the —~* Cofcast) 
length of a ferromagnetic rod ah g pee 
depends on the intensity of the § 

field. This is done in Fig. 67, =” 

which shows that nickel and an- P-gp. 

nealed cobalt shorten in fields of ¥ 

any intensity, cast cobalt shortens 8% -30 

in weak fields but elongates in § gy 

strong fields and, finally, ironelon- 8 1009 jog 
gates in weak fields and shortens Intensity 0 magnert field 
in strong ones. In any case, a fer- AE 
romagnetic rod can execute forced Fig. 67 


vibrations when placed in an al- 
ternating magnetic field. For this purpose, the rod is usually 
placed in the core of a transformer fed by alternating current. In 
order for the standing wave in the rod to be of sufficient inten- 
sity, it is necessary to operate under conditions of resonance, i.e., 
the frequency of the alternating field should coincide with the rod’s 
natural frequency of vibration. 

Since the rod is fixed at the centre, the natural frequency of 
vibrations is 


ast 
7 ope 


and the rod can vibrate only at the frequencies of the odd har- 
monics. Substituting the numerical values of the physical constants, 
the fundamental frequency for nickel turns out to be equal to 


ya kc/s (where / is in centimetres). 


Thus, a rod having a length of 40 cm will vibrate at a funda- 
mental frequency of 6 kc/s. 

A piezoelectric crystal is most commonly: used as a source of 
ultrasonic vibrations. 
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45. Piezoelectric Vibrations 


Any crystal that does not have a centre of symmetry in a num- 
ber of its elements of symmetry (see Sec. 262) may exhibit the 
piezoelectric effect. This phenomenon manifests itself in a change 
of the dimensions of a crystal under the action of an electric 
field and, conversely, in the creation of an electric field in a crys- 
tal under the action of forces applied to the crystal. When utilis- 
ing the piezoelectric effect as a source of vibrations, we are deal- 
ing, of course, with the former aspect of the phenomenon, known 
also as electrostriction or the inverse piezoelectric effect. Piezoelec- 
tric materials include quartz crystals, Rochelle salt, barium tita- 
nate, and dihydrophosphate of ammonia. Generally speaking, there 
are hundreds of known materials that could, in principle, be used 
for this purpose. However, additional requirements, e.g., durabil- 
ity and stability with respect to moisture as well as the natural 
desire to select crystals that will yield the strongest effect, sharply 
limit the practical choice of material. 

The change in crystal dimensions under the action of an electric 
field differs for different directions (with respect to the crystal’s 
axes of symmetry). Therefore, different deformations will be ob- 
tained when, from a crystal, we cut rods or plates having different 
orientations with respect to the crystal’s axes and place them be- 
tween condenser plates. Usually, the quartz plate or other piezo- 
electric material is cut in such a manner that longitudinal dis- 
placements occur in the material when it is placed in an electric 
field. Thus, under the action of an alternating electric field, the 
forced vibrations produce standing longitudinal waves. 

If ¢ is the thickness of the plate in the direction of wave mo- 
tion, then, as usual, the natural frequency of vibration is given 
by the formula ven. For a quartz crystal having this simple 
orientation, the velocity of the elastic wave is equal to 5,400 m/sec. 
Hence, the fundamental natural frequency of vibration of a quartz 
plate is determined from the formula 

yan kc/s (/ is in centimetres). 


It should be noted that the measured value is somewhat different, 
namely, 2,880// kc/s. 

The vibration amplitudes depend on the magnitude of the applied 
field, whereby a linear dependence exists between the displacement 
magnitude and the electric field intensity. It is not uncommon to 
use large field intensities. Since quartz is an excellent insulator, 
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electric fields of the order of 30,000 volts/em find application 
for thicknesses up to a centimetre. 

To attain a powerful ultrasonic signal, use is made of the res- 
onance effect. This is essential because resonance displacements 
are thousands of times greater than displacements under the action 
of static fields, and furthermore, the vibration energy is pro- 
portional to the square of the displacement. 

By gradually increasing the frequency of the generator, one can 
successively excite all the overtones of the crystal. The frequency 
range of commercial ultrasonic generators extends from hundreds 
to thousands of kilocycles. 


CHAPTER VII | 
ACOUSTICS 


46. The Objective and Subjective Nature of Sound 


Man can perceive the loudness, pitch and timbre of sound by 
means of his hearing organs. The electronic oscilloscope enables us 
to investigate the objective and subjective nature of sound in detail. 

Since sound is the result of a vibratory process taking place in 
air, it may be completely described by a curve showing amplitude 
change (immaterial whether displacement, vibrational velocity or 
pressure) with respect to time. Such a curve enables us to establish 
whether the process is periodic and, if so, to determine the funda- 
mental tone of the vibration. By studying the periodicity, the 
overtones present and their amplitudes may be determined. In other 
words, the curve showing the dependence of the vibration on time 
always enables us to find the spectrum of the vibration, i.e., to 
establish which frequencies are present and the amplitudes of these 
frequencies in the spectrum. The curve is obtained by means of 
a microphone connected to an oscilloscope. In more elaborate 
arrangements, the curve of the vibration is automatically converted 
into its spectrum. 

A simplified diagram of such an analyser is shown in Fig. 68. 
The input sound is converted by a microphone into electrical 





Fig. 68 


current, is amplified and applied to an apparatus consisting of a 
large number of filters, each of which passes a specific band of 


frequencies, e.g., : of an octave (36-48, 48-60, 60-72 cps, etc.). 


Sec. 46] The Objective and Subjective Nature of Sound 141 


The filters resolve the signal intoits spectrum; the narrower the frequency 
band of each filter the greater its resolving power. Each portion of 
the spectrum passing through a filter is fed via a commutator to an 
amplifier and detector (rectifier). The output of the detector is 
applied to the oscilloscope plates that deflect the electron beam 
in the vertical direction. lf a voltage is not applied to the second 
pair of oscilloscope plates, then, upon connecting each of the 
filters, the vertical deflection of the electron beam will be propor- 
tional to the amplitude of the corresponding frequency component 
of the spectrum. However, the arrangement is greatly improved 
by connecting a voltage to the second pair of plates that provides 
horizontal sweep of the electron beam. This is done by properly 
synchronising the rotation of commutator 2, which provides the 
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sweep voltage, with the automatic rotation of commutator ft. Thus, 
the amplitude of the component passed by each filter is displayed 
for a different, but unique, horizontal displacement of the electron 
beam. As a result, the complete spectrum is displayed on the 
oscilloscope screen. 

Periodic vibrations have discrete spectra, while nonperiodic 
vibrations have continuous spectra. Musical sounds are illustrative 
of the former, while various kinds of noise illustrate the latter. 

One and the same musical tone played on different instruments 
will have the same fundamental frequency, but will have different 
spectra. The quality of the sound is determined by the distribution 
of the overtone intensities (see Fig. 69). In the musical sense, the 
more complex the spectrum the richer the quality of the 
sound. It is interesting that the phase of the overtones (see 
the formula on p. 104) does not affect the subjective perception of 
sound. The ear can only distinguish between the intensities of the 
overtones. 

Noise analysis is of great practical importance. If the noise 
frequencies of largest intensity are known, the cause of the noise 
is more easily determined and, consequently, more easily eliminated. . 


47. Intensity and Loudness of Sound 


In Fig. 70, the heavy lines mark the limits of the region of auditory 
perception for the average person. Two uniquely related quantities 
are plotted along the ordinate, namely, the amplitude of sound pressure 
and the intensity of sound. The sound pressure p and the sound 
intensity / are related, in the simplest case, by the formula 
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We know that the intensity of the wave is 
. [=we, 
where w is the energy density, i.e., w=, But u= a (see p. 116). 


Hence, substituting, we obtain the above formula. The intensity of 
sound may be measured in watts/cm’. 

Very intense sounds produced by a pressure of about 2,000 bars 
cause a sensation of pain. Very weak sounds can still be perceived 
by the average person when they have a pressure of 2x 10~* bar 
(1 bar= 1 dyne/cm’). Since for air ec = 41, we obtain for the limits of 
sound intensity the values 0.5x10° ergs/secem? (=0.5x 107? 
watts/em’) and 0.5x 107** watts/cm’. 

This large range of intensities makes it convenient to introduce 
a logarithmic scale. If the intensity of one sound is /, and the 
intensity of another /,, we say that I, is K decibels louder than 
I, when 


K =10log2. 
1 


The quantity K is called the loudness level. Thus, if the sound 
intensities differ by a factor of a million, they differ in loudness 
by 60 decibels. 

When expressing the sound intensity in decibels, it is necessary 
to indicate the zero level. The value usually selected for this lev- 
el is close to the threshold of audibility (107** watts/cm’). A whis- 
per, then, has a loudness of about 15 db and the noise of an air- 
plane about 120 db. 

Returning to the diagram of auditory perception, we see that the 
region of speech is quite restricted in frequency (100 to 10,000 cps) 
as well as in intensity (40 to 80 db). Sounds of different frequency 
have different audibility. The human ear perceives frequencies of 
several thousand cycles per second best of all. Below 20 cps lies 
the infrasonic region and above 10,000-20,000 cps the ultrasonic. 

The table gives approximate values for the sound pressure p, the 
intensity J and the loudness K. 





| Pp I K 

(bars) (watts/cm?)| (db) 
Threshold of audibility.........008. 2.91074 107%8 0 
Drip of thawing snow. .........06. 2.9 107 107 20 
Low conversation at a distance of 5metres . . .| 2.9% 107? 107%? 40 
Symphonic orchestra(fortissimo) ....... 2.9 1078 80 


Airplane engine at a distance of metres . . . 290 10-4 120 
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48. Architecture and Acoustics 


In some auditoriums speech is unintelligible even though suffi- 
ciently loud, while in others the speaker must raise his voice in 
order to be heard. Let us investigate the physical parameters of an 
auditorium that determine its acoustic properties. 

Experiments show that the most important factor of this nature 
is the so-called reverberation time, the time in which a sound de- 
creases to one-millionth of its original intensity. With respect to 
acoustics, an auditorium is best when its reverberation time v is 
0.5-1.5 sec. If t is less than 3 sec, the auditorium is considered 
to be good. If the reverberation time exceeds 5 sec, the acoustics 
are very bad, being characterised by “resounding”. 

A sound uttered at some part of a large hall is reflected from 

the walls, floor and ceiling, the furniture and drapes, and from the 
clothes of those present. If for each reflection the sound loses a 
large part of its energy, the sound will be attenuated very rapidly. 
The reverberation time in this case is very small and the sound 
will be “dull”. Resounding occurs when the sound is repeatedly re- 
flected with little attenuation. The listener will perceive the direct 
wave, the wave after one reflection, two reflections, etc. If the in- 
terval of time between the arrival of these sound waves does not 
exceed 1/15 of a second, the ear will not perceive two or three 
distinct sounds as in the case of echoes, but rather a prolonged, 
and hence unclear, sound. 
- It is evident that the time attenuation of sound is determined 
by its absorption in the surrounding bodies. Since the sound is re- 
peatedly reflected, after a short time of constant sounding from 
some source the auditorium will more or less uniformly fill up with 
sonic, i.e., vibratory, energy. Within a short period of time, equi- 
librium is established between the energy delivered by the source 
and the energy absorbed by the medium. It should be noted, inci- 
dentally, that in the absence of absorption the sonic energy in a 
closed room would increase without limit for continuous sounding 
of a source. 

If the sound source is interrupted, then the phenomenon reduces 
to the absorption of the sonic energy by the surface of the bodies 
located in the room. Each of the materials involved in this proc- 
ess has its own characteristic coefficient of absorption a. If there 
is an open window in the room, the absorption coefficient may be 
assumed to be equal to 1, since the sound: completely leaves the 
room, and this is equivalent to being absorbed. For a smooth, sol- 
id wall the coefficient a is almost zero (for concrete, it is 0.015). 
Now, the sound absorption for the entire room-may be described 
by the expression A=a@,S,-+a,S, +a,S,-+..., where the sum takes 
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into account all the surfaces in the room. Theory shows that 
the reverberation time depends on the quantity A and the volume 


of the room V, i.e., t= 0.16. In this formula, the volume is 


expressed in cubic metres and the quantity A in square metres. 

By means of these formulas, it is not difficult to calculate the 
reverberation time. The absorption coefficient for concrete is given 
above; for glass, wood and plaster, it is not much larger (up to 
3 per cent). A sharp increase in absorption occurs when soft mate- 
tials are brought into the room. Suffice it to note that the clothing 
of one person absorbs as much sound as 20 square metres of wall 
surface. For soft materials, the coefficient of absorption varies be- 
tween 0.5 and 0.9. A large role in the solution of acoustical prob- 
lems in the construction industry is played by porous materials 
(e.g., spun glass and porous concrete), whose coefficients of absorp- 
tion approach the values of a for soft materials. 


49. The Atmosphere and Acoustics 


When a wave passes from one medium into another, it changes 
its direction of propagation in accordance with the law of refrac- 
tion. The angle by which the direction of propagation changes is 
determined by the index of refraction, i.e., the ratio of the ve- 
locities of propagation. 

It was indicated in Sec. 82 that the velocity of sound propaga- 
tion is sensitive to changes in temperature. A temperature increase 
of 1°C increases the sound velocity by about 0.5 m/sec. The tem- 
perature of different layers of the Earth’s atmosphere has, as a rule, 
different values. Thus, in different layers of the atmosphere, sound 
will have different velocities. How 
is the propagation of sound affected 
by the fact that the sound travels 
in a medium in which the refrac- 
tive index is continuously changing? 

Let us answer this question by 
referring to Fig. 71. Assume that 
the sound passes through a series 
oi layers: and. that ‘in: each- layer ——-=—-=—-——— => -—-=—— 
the refractive index is constant but Fig. 71 
changes abruptly from layer to 
layer. The path of the sound wave is represented by the broken 
line. If the thicknesses of the layers are small and the differ- 
ences in the refractive indexes begin to decrease, the broken line will 
approximate a curved line. Thus, in a medium of variable index 
of refraction, sound waves propagate, generally speaking, along 
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curved lines. Moreover, the path is always such that the wave trav- 
els from point to point in the shortest time. This proposition is 
known as Fermat’s principle. A straight line, in this case, is in 
a certain sense not the shortest. 

We shall demonstrate the validity of this principle for the case 
of two adjacent segments of the broken line just considered. Let 
us assume, for simplicity, that the thickness d is the same for both 
layers and that the propagation velocities v, and v, are different. 
The time required for a wave to traverse the path indicated in 
the figure is equal to 


t= = Vere oo V (a—x)* +a’, 
Here, the time is expressed in terms of the independent variable x. 


For different values of x, the refraction will differ and so will the 
time of travel from the initial point to the final point. The least 


time will be taken when the condition 5! =0 is satisfied, i.e., when 
0, x a—x 


M  Vedd 


Vet Via—xvpe 


—*. is the sine of the incident afigle and +. 
V era V (a—xy?+d# 
the sine of the refraction angle. This proves that the refraction of 
the wave occurs in such a manner that its time of travel is a min- 
imum. It should be emphasised that this result is valid not only 
for elastic waves but for all undulatory processes. 

Thus, a wave travelling in a nonhomogeneous medium changes 
its direction in such a manner that its path is lengthened in a me- 
dium in which the propagation velocity is larger and shortened in 
a medium in which the propagation velocity is smaller. In other 
words, a wave in a layer where the propagation velocity is large 
will tend to travel parallel to the layer, while in a layer where 
the propagation velocity is small it will tend to travel perpendic- 
ular to the layer. 

This is clearly illustrated in Fig. 72. Here, the path of a sound 
wave is schematically shown for the case when the temperature 
of the air decreases with height (usual day-time condition) and for 
the case when the temperature increases with height (night-time 
condition). 

In the former case, the velocity of sound propagation is large in 
the layers close to the Earth. If we trace the propagation of the 
sound wave emanating from a point above the Earth’s surface at 
a small angle with the vertical, the following situation is seen to 
prevail. Each of the successive layers deflects the wave further and 
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further away from the vertical. When the angle of incidence be- 
comes equal to the angle i,, for which <=, 
and total reflection occurs. Formally, the reason for total 
reflection is clear, namely, sini cannot become larger than 
unity. The physical basis of this interesting phenomenon will be 


refraction ceases 


a) Day- time 





Source of sound 
5) Night-time 


Source of sound 





considered below (Sec. 128) in connection with electromagnetic 
waves. In any case, instead of being propagated along the Earth’s 
surface, the wave is turned in an upward direction. The diagram 





makes clear how “zones of silence” are formed. At night, the path 
of a sound wave is turned convex upwards. As a result, audibility 
at night is much better than during the day. When sound’ propa- 
gates over a reflecting surface (a still body of water), it can be 
heard for several kilometres even when its intensity is relatively 
low. The path of such a wave is represented by a series of con- 
secutive convex arcs (Fig. 73). 
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50. Ultrasonics 


The vibratory energy in a unit volume of a sonic field is proportional 
to the square of the frequency. Thus, the density of the vibratory 


energy is wae, but the amplitude of the velocity is u,=Aw, so 


that w is proportional to w*. A powerful ultrasonic source is capa- 
ble of producing vibrations with a pressure amplitude of dozens 
of atmospheres. This means that in small volumes of matter, we 
go through the following cycle several thousand times per second: 
up to dozens of atmospheres of compression, down to zero, then up 
to dozens of atmospheres of expansion, ete. 

It is evident that a powerful mechanical action of this nature 
may have a number of specific effects. One such effect is cavitation. 
At the instants of vibration corresponding to maximum expansion 
in a liquid located in an ultrasonic field, microscopic explosions 
occur and dissociated gases and steam rush into this region. At the 
instants of vibration corresponding to compression, tremendous pres- 
sures of the order of thousands of atmospheres are produced in the 
regions of these explosions. 

This powerful force may be used to overcome the forces acting 
between molecules. Emulsions such as fat in water and benzene in 
water become dispersed under ultrasonic action. Sooner or later cav- 
itational explosions occur in the suspended particles. This disin- 
tegrating action has found wide application in industry. 

However, ultrasonic action may be of considerable importance 
even when cavitation does not occur. Thus, if an ultrasonic wave is 
passed through aeorsol (a suspension of solid particles in a gas, e.g., 
smoke), the particles are precipitated out. The vibrations cause tne 
solid particles to gather at the sound pressure nodes, where the 
particles merge and become sufficiently heavy to fall to the ground. 

Finding blowholes, internal cracks and other defects in metals 
by means of ultrasonic irradiation is another important field of 
application. The method is based on the reflection of such a wave 
from the boundary between the medium and air, i.e., between the 
metal and the inclusion. Only if the dimensions of the defect are 
greater than a wavelength will the method work. In order to de- 
tect a defect having a dimension of 1 mm, the wavelength should 
be less than 0.1 mm, i.e., a frequency of the order of 10° cps. The 
frequencies used are usually much lower (10’ cps), the method be- 
ing employed to detect large flaws. 

As is well known, ultrasonics also finds application in echo 
sounding and underwater location. 


CHAPTERIX 
TEMPERATURE AND HEAT 


51. Heat Equilibrium 


When all the properties of a body remain unchanged, we say 
that the state of the body has not changed. On the other hand, 
when some property of the body changes, its state changes. The 
state of a body may be changed by doing work on it. How- 
ever, the same resulis may be achieved without using mechan- 
ical means. Water is heated by intensely stirring it or by plac- 
ing it on a gas burner. Heat exchange is said to take place when 
the external medium or surrounding bodies act on a body or sys- 
tem of bodies under consideration so as to change the state of 
this body or system of bodies by nonmechanical means. 

If there is no heat exchange between the bodies, the bodies are 
in thermal equilibrium and have ihe same temperature. The presence 
of thermal equilibrium can be directiy verified by bringing the 
bodies into contact with each other: whereupon the states of the 
bodies after contact should not differ from the states before con- 
tact. However, heat exchange is also possible when bodies are far 
apart. Thermal equilibrium may be detected, in this case, by means 
of a third body acting as a thermometer. If the thermometer is 
in equilibrium with both bodies, the temperature of these bodies 
is the same. This means that they would also be in a state of 
thermal equilibrium when in direct contact. By means of a “third 
body”, a thermometer, it can always be ascertained whether bodies 
have equal or different temperatures. 

By means of a thermometer, we can establish not only whether 
thermal equilibrium prevails or not, but also the extent of a partic- 
ular deviation from equilibrium. To obtain a suitable thermome- 
ter, it is first necessary to agree on the type of thermometer (mer- 
cury, alcohol, water or gas) and the property (indication) by which 
we shall judge whether thermal equilibrium has been achieved be- 
tween object and thermometer. As always in physics, it is important 
to agree on what instruments, in this case thermometers, wil) be 
considered primary. A thermometer can, then, always be calibrated 
by means of a standard. Gaseous hydrogen is the material used in 
a standard thermometer and the gas pressure p is the property by 
which the temperature is determined. The temperature of a body 
is taken as proportional to the hydrogen pressure in a gas ther- 
mometer at the constant volume occupied by the hydrogen. 
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The temperature scale is selected as follows. We call the temper- 
ature of the melting point of ice 0° and that .of the boiling 
point of water 100° (at a pressure of 760 mm of mercury). Meas- 
uring the hydrogen pressures p, and p,,, at these two points, and 
drawing a straight line through the plotted points,we obtain the 
Pp Celsius, or Centigrade, scale. 

The equation of this line, 
shown in Fig. 74, has the form 





fa, PP, y'100; 
Peco Proo— Po 





The straight line intersects 

278TC 5 the ¢-axis at a temperature of 
ue : — 273.1°C. This is absolute 

Fig. 74 zero. By definition, lower tem- 


peratures are not possible. In 
physics, we often use a temperature calculated on the basis of 
absolute zero, namely, 7=t-+273.1°. This is called the absolute 
temperature or the temperature in degrees Kelvin (°K). 

When we calibrate working thermometers with respect to a hy- 
drogen standard, only a limited interval of temperatures may be 
used. At high temperatures, diffusion of the hydrogen through the 
walls of the vessel may begin to occur. At low temperatures, the 
hydrogen may liquefy. Nevertheless, the adopted method of deter- 
mining temperature has complete general validity as will be shown 
below (p. 159). 


52. Internal Energy 


The basic characteristics of bodies in the presence of mechanical 
and heat interaction is very well depicted by the so-called kinetic 
molecular model. A body consisting of molecules is considered as a 
system of moving and interacting particles subject to the laws of 
mechanics. Such a system of molecules has an energy consisting of 
the potential energy of the interacting particles and their kinetic ener- 
gy of motion. This energy is called the internal energy of the body. 

A specific internal energy corresponds to a specific state of the 
body. Changes in the mutual disposition or character of particle 
motion are related to changes in internal energy. Irrespective of the 
means employed to increase the internal energy of a body, the sur- 
rounding bodies must transfer energy to the molecules of the body 
under consideration. If the body is subjected to mechanical action, 
the energy transfer occurs in a regular manner. In the case of heat 
exchange, the energy is transferred through chance impulses trans- 
mitted now to one, now to another molecule. 
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The quantity of energy transferred to a body by mechanical 
means is measured by the amount of work done on the body. The 
quantity of energy transferred through heat exchange is measured 
by the quantity of heat. 

Since the exact calculation of the internal energy of a body is very 
difficult and in most cases impossible, and since the very conception 
of internal energy as a purely mechanical quantity is only a rough 
one, a clear determination of this quantity is necessary. This can 
be done by considering processes occurring without heat exchange 
with the surroundings, i.e., so-called adiabatic processes. Adiabatic 
conditions can be provided by using a thermally insulated contain- 
er for the experiment and taking the measurements during short 
intervals of time (so that the heat does not have time to “escape” 
from the volume under study). Numerous experiments leading to 
the establishment of the law of conservation of energy show that, 
irrespective of the means employed to change the state of a body 
in such a process, the amount of work required is exactly the 
same in each case. The magnitude of this work, A, is equal by 
definition to U, the increment of the internal energy of the body: 


A220. 


Naturally, the absolute value of the internal energy cannot be 
determined from the experiment. 

If the mechanical model of a body were a completely faithful 
representation, the above expression would be a simple consequence 
of the law of conservation of mechanical energy. However, the 
kinetic molecular model is only a model and, therefore, the fact 
that there is a specific energy corresponding to each state of a 
body, so that the diflerence in energy between two states is equal 
to the adiabatic work of transition, represents an extremely impor- 
tant law of nature leading to the law of conservation of energy. 

Heat exchange and mechanical action can lead in a number of 
cases to the same change of state, i.e., to the same change in the 
internal energy of the body. This enables us to equate heat and 
work by measuring the quantity of heat in the same units as 
work and energy. 


To obtain an idea of the magnitudes of various internal energies, let us cige 
some figures. 7 

When the temperature of 1 gm of water is raised by 1°, the energy in- 
treases by 


1 calorie = 0.427 kg-m= 4.18 & 10? ergs = 2.61 & 10” ev. 
In this case, the average increase in energy of a molecule is 
3 1078 calorie = 1.28 1077? kg-m = 1.25 & 107" erg = 7.83 & 1074 ev. 
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The internal energy given up by matter in the combustion of 1 gm of coal 
amounts to 


7,000 calories = 2,990 kg-m = 2,93 & 10" ergs — 18.3 10” ev. 
Calculating this on the basis of one atom of carbon, this figure reduces to 
1.4 107? calorie == 5.98 & 1072 kg-m=5.86 & 107" erg ==3.66 ev. 
The energy released in the nuclear fission of 1 gm of uranium-235 is 
2.03 X 10! calories = 8.65 10° kg-m = 8.49 10!” ergs = 5.29 & 10®° ev. 


Calculating this on the basis of one atomic nucleus, the internal energy given 
up amounts to 
7.9 X 10712 calorie= 3.38 & 10°! kg-m=3.3 & 1074 erg= 
= 206 10° ev = 200 Mev, 


which is more than 59 million times greater than the energy of chemical reac- 
tions. 


53. The First Law of Thermodynamics 


In the most general case, when energy is exchanged with the 
medium or with surrounding bodies, the system under considera- 
tion may receive or give up a quantity of heat Q and perform or 
have performed on it a given quantity of work A. Heat and work 
are the two forms in which the energy of a body may be trans- 
mitted to the medium or, conversely, the energy of the medium 
may be transmitted to the body. The law of conservation of energy 
excludes the possibility of any loss in the energy exchange. The 
difference in the energies of the system for the two states must 
equal the sum of the heat and work obtained by the system from 
the surrounding bodies. 

This proposition could not be subjected to experimental verifica- 
tion if we did not add that the incremental energy due to the 
transition from one state to another is always the same irrespective 
of the character or method of transition from the initial to the 
final state. It is precisely this provision that embodies the law of 
conservation of energy. Now, it can clearly be subjected to all-sided 
experimental verification by measuring the heat and work imparted 
to the system for various transitions from a particular initial state 
to a particular final state. The incremental energy in all cases will 
be identical. 

The law of conservation of energy expressed in the above con- 
crete form is called the first law of thermodynamics. This very 
important law of nature was established as the result of the work 
of a number of scientists in the middle of the last century. The 
roles played by Robert Mayer, Joule and particularly Helmholtz 
rank especially high on the list. As physics developed, the relation- 
ship of the first law of thermodynamics to the more restricted law 
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of conservation of mechanical energy and the more general law 
of conservation and transformation of energy became ever clearer. 
During the early development of physics, there was a tendency 
to equate the law of conservation of mechanical energy with 
the first law of thermodynamics. The growth of science has 
shown ihat such a simplified view is erroneous, that many 
‘forms of energy do not reduce to mechanical energy. It was noted 
by Engels that the law of conservation and transformation of 
energy, which indicates that any form of motion may be trans- 
formed into any other, subject to the preservation of certain quanti- 
tative relationships, is the most general law of existence of matter. 
The first law of thermodynamics is a concrete expression of this 
law. 

In order to write the first law of thermodynamics in the form 
of a formula, we must first agree on the choice of sign for heat 
and work. Let us assume that heat is positive when it is imparted 
to the system and work is positive when a body performs it against 
the action of external forces. The first law of thermodynamics 
may then be written in the form 


AQ = dU +-AA, 


i.e., the heat applied to a body goes to change the internal energy 
and perform the work of the body. Naturally, each of the quanti- 
ties entering into the equation may be positive or negative depend- 
ing on the particular transformation being considered. 

It is not accidental that in writing the above equation the 
differential sign was used only for the energy. Work and heat are 
not total differentials. When a body goes from one state to another, 
the work and heat received or given up by the body depend on 
the “path” traversed, and only the energy increment, as in the 
case of the total differential of a function, does not depend on the 
manner of transition: 


(au=u,—v,, 
1 


The law of conservation of energy and, in particular, the first 
law of thermodynamics apply in all branches of physics. Science 
is enabled to make particularly valuable predictions on the basis 
of this law. Without knowing anything about the nature of a par- 
ticular process except the initial and final states of the system 
under consideration, a number of important conclusions may be 
drawn. For example, assume that molecules A and B are united by 
a chemical reaction to form the molecule AB. Assume, further, 
that U,, Ug and U,,, the internal energies of the molecules, are 


154 Temperature and Heat [Chap. IX 





known. If Uy, is greater than U,+U,, we are able to predict 
that the reaction will proceed with the absorption of heat and the 
quantity of heat will be equal to Q=U,,—(U,+U,). Or, if we 
know U, and U,, and measure the heat of the reaction by means 
of a calorimeter, we can determine U,,. These data can then be 
used to predict the course of some other reaction involving the 
compound AB. 


54. The Internal Energy of Microscopic Systems 


Naturally, the law of conservation of energy and the principles 
of energy exchange are valid for large bodies as well as for the 
individual particles of a body. However, when considering particles 
(nuclei, atoms and molecules), or systems consisting of a small 
number of particles, one other important law of nature must be 
taken into consideration, namely, the energy of a microscopic system 
cannot assume any arbitrary value. Each system has a sequence 
of possible values of internal energy, U,, U,, ..., that is charac- 
teristic of it alone. Figure 214 (p. 485) shows the possible energy 
levels for a hydrogen atom. Similar diagrams may be drawn for 
the energy levels of other atomic systems. When imparting heat or 
work to a system, the energy of the atoms, molecules or other 
microscopic systems may increase only by specific, discrete amounts 
(quanta). In exactly the same manner, energy is given up to sur- 
rounding bodies in quanta. 

Strictly speaking, the law of the quantum character of energy 
and the existence of a “scale” of possible energy levels for each 
microscopic system is a perfectly general law of nature that is 
valid for large bodies as well. However, as is shown in theoretical 
physics, the number of energy levels in a large body consisting 
of n atoms is, roughly speaking, n times the number of energy 
levels in a single atom. 

As the energy increases, the intervals between levels become smaller 
and smaller (see the diagram for hydrogen). The reduction in the interval 
between energy levels is incomparably more rapid for a large body 
than for an individual atom and only the very lowest levels appear dis- 
crete. The higher levels merge and it appears as though a large 
body can change its energy continuously. If energy is taken away 
from a body, it “descends” to a lower level. Hence, the lower the 
temperature of a body, i.e., the closer it is to absolute zero, the 
sharper the quantum character of the energy changes. , 

Mechanical action serves to shift the energy levels of a body or 
system, but in the overwhelming majority of cases this displace- 
ment cannot be observed. For microscopic systems—atoms and 
molecules—the effect of pressure is very small. 
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Thermal interaction consists in the transitions of a system from 
one energy level to another. 

Thermal equilibrium is mobile equilibrium. A body does not have 
a single energy all the time, but is rather continuously exchanging 
energy with its surroundings, so that on the average the energy 
remains unchanged. The exchange of energy occurs in discrete 
amounts or quanta. If at one instant the energy is equal to U,, 
the next instant it has changed abruptly to U,. 

Energy is given up in the form of radiation. If U,>U,, then 
U,—U,=hv, where v is the radiation frequency and A is Planck’s 
constant. This constant is equal to 6.62 107*’ erg sec. Energy may 
be gained by absorbing radiation or as the result of a mechanical 
impulse from some particle. 

If the temperature of a body drops instead of remaining constant, 
this signifies that the number of transitions from higher to lower 
levels exceeds the number of transitions in the reverse direction. 
The energy decreases in jumps and the body emits one quantum 
of radiation after another. 

The diagrammatic representation of energy exchange was origi- 
nally carried out only for atoms. Somewhat later, it became evident 
that this representation has general validity. We refer the reader 
to Part III of this book for further details on this subject. 


55. The Equation of State 


Three basic properties or parameters of state may be selected from 
{he various properties of a body. These are the pressure p, the 
volume v and the temperature 7. Knowing these parameters is not 
always sufficient to exhaustively describe a body. If a system con- 
sists of various substances, we must also know their concentrations. 
If a body is located in an electric or magnetic field, we must know 
the intensity of the field. However, it is always possible to select 
a group of parameters that will uniquely determine the state of a 
body. The other characteristics may then be calculated from the 
basic parameters. 

Leaving electromagnetic fields out of consideration and restrict- 
ing ourselves to simple systems—gases, liquids and isotropic solid 
bodies—it turns out that only two parameters determine the state 
of a body. It is immaterial which pair of parameters are seleeted 
from p, v and JT. Usually, v and 7 are selected. The pressure 
p is then a function of uv and 7. We call the equation 


p=fv, T) 


the equation of state. It is of very great importance in physics to be 
able to determine this equation for a body and, in particular, 
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for a class of bodies. Equations of state may be established only 
experimentally. The nature of the dependence of pressure on vol- 
ume and temperature for liquids and solid bodies varies tremen- 
dously from case to case. By establishing the equation of state 
for a given body, we are able to determine its behaviour under 
a variety of conditions, but this does not in any way enable us 
to determine the behaviour of other bodies. 

Quite often, the behaviour of a substance is not described by an 
equation of state, but rather by the derivatives of certain of the 
parameters with respect to the others. 

To establish how a body expands with increasing temperature, 


at constant pressure, we must determine oe (the derivative ofv 
with respect to T at constant pressure). The quantity 


1 /dv 
(5) 
is called the thermometric coefficient of dilation. As can be seen 
from the formula, @ gives the relative change in the volume of a 


body for a 1° change in temperature. 
The thermometric coefficient of change of pressure 


B= 5 (ar), 


gives the relative change in the pressure for a 1° change in temper- 
ature (at constant volume). The dimensionality of the coefficients a 
and B is given in reciprocal .degrees. 

The third useful quantity is the compressibility 


pi ($ 
“0 \dp/ 7’ 


which gives the relative decrease in volume for a unit increase in 
pressure (at constant temperature). 

These three coefficients are connected by a very interesting rela- 
tionship, which is easily derived as follows: Since 


p=f(v, T), 
then 


_ ( 9P op 
do— (ar), 47 +(35). 4° 
If the pressure is constant, then dp=0O and 


(ar ), (ar), (3e)e= —" 


Sec. 55] The Equation of State 157 





Hence 

Bx ol 

eye 
This result shows that if we know, for example, the compressibility 
and the thermometric coefficient of change of pressure, we can 
calculate the value of the thermometric coefficient of dilation. The 
derived relationship is valid for all bodies. 

Generally speaking, the coefficients a, B and x are not constant 
quantities for a given substance. For various pressures and temper- 
atures, these coefficients may assume various values. Hence, when 
the value of a coefficient is given, it is necessary to indicate the 
corresponding values of pressure and temperature. In some cases, 
the average value of the coefficients is given over a particular in- 
terval of temperatures or pressures. 


Here are some examples: 
a) The table gives the thermometric coefficient of dilation a and the compres- 
sibility x for some liquids. 





— 


| a (deg -1) | x (atm!) 





Water, 10°-30°C, normal press ........ 2.07 x 107* ee 

° 1.81 x 107 -95 x 107 
Mercury, 10°-80°C 2... 2... eee eae - | 16.56%1074 14631078 
Ether, OC ........ Wen Ge ea! Sess Aan we ods 


For solid bodies, the thermometric coefficient of dilation and the compressi- 
bility may vary considerably. Thus, at normal temperature and pressure, for 
fused quartz, a=1,.29 x 107° deg? and »=2.7x107°, while for ebonite, 
a=77X107® deg™! and x=18x107% 

b) The values giveninthe table for B, the thermometric coefficient of change of 
pressure, are calculated for water, mercury and ether at atmospheric pressure 


1 | 


a 








This means that when the temperature of a constant volume of mercury is 
increased by 107* degree, its pressure increases by 6 per cent (!). 
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56. The Equation of the Gas State 


The simplest equation of state is that of a rarefied gas. It was 
obtained by Mendeleyev in the form of a single formula that com- 
bines Clapeyron’s equation and Avogadro’s law. Clapeyron’s equation 


states that & is a constant for a given mass of gas: 


p? — const. 
T 


Avogadro’s law states that gram molecules of different gases at 
constant temperature and pressure occupy equal volumes (22.41 
litres at a temperature of O°C and a pressure of 1 atmosphere *). 
Hence, for one gram mole, the constant in Clapeyron’s equation 
is a universal constant. It is designated by the letter R and is 
called the universal gas constant. For a mole of any gas, the equa- 
tion assumes the form 


puv= RT. 
Here v is the volume of a mole of gas. The constant R has the 


dimensions of work per mole degree. Expressed in various units, 
its value is 








— 7 ergs joules att 
R=8.31x 19 mole deg _ 8.31 mole deg 
atm litres calories 
= 0.0821 mole deg ~~ mole deg ° 


Since the volume of an arbitrary mass of gas is Vv, where 
wis the number of moles, the equation of state for a rarefied gas 
assumes the following form in the most general case: 


pV =pRT or pV = 4 RT. 


Here m is the mass and M is the molecular weight. 
This equation yields the following convenient formula for the 
gas density @: 
_, Mp 
OC RT: 


Gases obeying the equation of the gas state are called ideal 
gases. The simplicity of the equation would in itself be sufficient: 
grounds for using this term. However, as we shall see below (p.190), 
this equation may be derived by assuming the gas to be represented 


* A physical atmosphere is meant here (1 physical atmosphere = 1.033 engi- 
neering atmospheres). 
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in an ideal system. An ideal gas is then a system of molecules 
whose dimensions and forces of attraction may be neglected. 
For ideal gases, the coefficients of dilation, change of pressure 
and compressibility are given by the following simple formulas: 
1 1 
a=Pp=7, aan 
At a temperature of 0° C (T = 273.13°) 
1 ~1 = 
a= B = 973.13 deg = 0.00366 deg tS 


The following data show to what extent certain actual substances 
approach the ideal condition: 


a at V=const 


Hydrogen ....... 3,660 x 1078 
Helium ........ 3,660 x 107° 
Nitrogen... .. eee 3,674 107° 
Carbon dioxide. .... 3,726 x 107° 
7h a ee em 3,674x 107° 


Gaseous substances under a pressure that considerably exceeds 
atmospheric cease to obey the formulas of an ideal gas. Calculations 
may lead to errors of several per cent at pressures of only tens 
of atmospheres. 

An important conclusion to be drawn from the study of rarefied 
gases is that, generally speaking, any of these gases—and not only 
hydrogen—may be used as a basis for determining temperature. 
Hydrogen provides no particular advantage over other rarefied 
gases. It can, therefore, be said that the temperature scale adopted 
in physics is not a hydrogen scale, but rather a scale of pressures 
for an ideal gas. Herein lies the advantage of the method adopted 
for determining temperature, namely, the existence of a large class 
of substances leading to temperature scales that are identical. The 
kinetic molecular basis for the selection of the temperature scale 
will be given below (p. 190). 


57. The Equations of State of Actual Gases 


The equation of the gas state begins to yield very rough results 
for gases at high pressures, steam close to saturation and a number 
of other cases. Other equations of state are, therefore, required for 
such cases. Some are determined experimentally, while others (the 
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most well known being Van der Waals’ equation) have, qualita- 
tively, a theoretical basis. In any case, the validity of one or another 
equation can be established only through comparison of the results 
calculated by means of the equation with the results obtained 
experimentally. We shall now give some examples of equations 
of state. 

Naturally, the simplest correction that can be introduced into 
the equation for ideal gases takes account of the volume of the 
gas molecules. It is evident that a gas cannot be compressed to 
zero volume even if the pressure is infinitely large. Hence, the 
equation of state may be written in the form 


p(vu—b)=RT, 


where 5 is a constant that takes account of the finite volume of 
the molecules. 

The greater the number of constants introduced into the equation 
of state the easier it is to achieve close agreement between experi- 
mental and calculated values, but the more difficult it is to predict 
changes by means of the formula. Excellent agreement with experi- 
ments over a broad interval of values for the parameters of state 
is obtained by means of the formula proposed by Beatie and Bridge- 
man. It contains five constants—A, B, a, b and c—descriptive 
of the substance: 

Ti , A’ 
p=) (Wo +B gee 
where 
Oo & a Mae 6 \ pa He. 
A ~A(1—<), Bis (1-4), es 


oT" 


Dieterichi’s formula contains three constants—a, b and s: 
p(v—b)=RTe #787, 
Van der Waals’ equation contains two constants—a and b: 


(p-+e) (0-8) =RT. 


The merit of the last equation is that it correctly reflects the gener- 
al character of the dependence between the parameters for all 
gaseous substances. However, for a given substance, it is not pos- 
sible to select constant values for a and 6 in such a manner that 
the calculations agree closely with measurements over a_ broad 
interval. 

Van der Waals’ equation is based on the following: The pressure 
satisfies the equation of the gas state, i.e., pat, when the 
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forces of attraction between the molecules are neglected. But due 
to the mutual attraction of the molecules, the pressure on the walls 


of a vessel should decrease by some value p’. Thus, = Atay’. 
Now, taking the finite volume of the molecules into consideration, 
RT 





p=s3—P’ or (p+p’) (v—b)=RT. 


a : 
Why does p’=-,? Here, we reason as follows: Let us consid- 


er the gas volume divided into two parts. One part, then, attracts 
the other. The forces of attraction are proportional io the number 
of molecules in the left-hand part and to the number of molecules 
in the right-hand part. In other words, the forces of attraction are 
proportional to the square of the density, i.e., inversely propor- 
tional to the square of the volume. 

The forces of attraction between molecules will be discussed in 
more detail in Part ILI. 


CHAPTER X 
THERMODYNAMIC PROCESSES 


58. Graphical Representation 


If two parameters of state are given for a body, the third may 
be calculated by means of the equation of state. Thus, when one 
parameter (e.g., pressure) is plotted along one axis and a second 
parameter (e.g., volume) is plotted along the other axis, the state 
of the body is uniquely described by a plotted point. 

To be sure, it has been assumed in the graphical representation 
that the state of the body is in equilibrium. Only then will the 
values of the parameters of state be the same throughout the volume 
of the system and are we justified in speaking of the temperature, 
pressure, density, etc., of the body (or system) as a whole. 

The question arises: What kind of process can be involved if 
equilibrium states are being considered? The answer consists in the 
following: In a process that proceeds sufficiently slowly, the values 
of the parameters of state throughout the volume are equal. Such 
a process may be considered to be a continuous succession of equi- 
librium states. It is reversible and may occur in either direction. 
A process consisting of a succession of equilibriums may proceed 
from state / to state 2 and then from state 2 back through all the 
intermediate states to state 7, without Producing any changes in 
the surrounding medium. 

A reversible process is an idealised process. Every actual process 
is in one way or another irreversible, depending on how far away 
the intermediate states of the process are from equilibrium. 

This becomes clear from the following reasoning. Every establish- 
ment of equilibrium is irreversible. Many simple and familiar 
examples can be cited: the cooling of a body by placing it in 
a cooler surrounding, the “dissipation” of a mechanical deformation, 
e.g., the restoration of a compressed spring to its undeformed 
position upon being released, the spontaneous intermixing of two 
gases, etc. Reversible processes cannot proceed of themselves. They 
cannot be single processes occurring in a closed system. 

An actual process does not consist of a succession of equilibriums. 
Inevitably, such phenomena occur as those enumerated above. 
Hence, when the process is made to proceed in the reverse direction, 
it will never pass through exactly the same states as during the 
forward direction. When a gas is rapidly compressed, the pressure 
of the gas in layers close to the piston will be higher than elsewhere. 
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On the other hand, during the reverse process—expansion of the 
gas—the pressure near the piston will be lower than elsewhere. 

Nevertheless, in spite of the fact that reversible processes are an 
idealisation, they are of great interest since in many cases the differ- 
ence between actual and reversible processes is insignificant. It all 
depends on the relaxation time, i.e., the time it takes for equilibrium 
to be established. This time varies within very broad limits— 
from ~ 107*° sec, the time it takes for the pressure to equalise in 
a homogeneous gas, to minutes, hours, or even weeks, for processes 
involving heterogeneous substances. 

Let us assume that a gas is compressed and the entire process 
takes one second. The relaxation time is an insignificant fraction 
of a second. We may therefore consider the actual process as a 
succession of equilibrium states and 
draw the curve on a graph showing 
p and v, or on some similar dia- 
gram. The same holds true for all 
other processes in which the relaxa- 
tion time is small with respect to the 
duration of the process. 

Fig. 75 shows several curves repre- 
senting simple processes. The coordi- 
nates of the graph give the pressure 
and the volume. In engineering ther- 
modynamics, other coordinates are 
used in addition to these, but we need Fig. 75 
not discuss them. The vertical line 1 
in the figure represents a process at constant volume. If the point 
generating the curve is moving upwards, the pressure is increas- 
ing; if the reverse is true the pressure is falling. It is clear that 
a change in temperature occurs during this process that is not “seen” 
onthe diagram. The horizontal line 2 represents a process at con- 
stant pressure (isobaric process). Moving from left to right signifies 
expansion, while the reverse corresponds to compression. The curve 
designated by the figure 3 corresponds to an expansion accompanied 
by a drop in pressure, while curve 4 represents an expansion in 
spite of the increasing pressure. The change of temperature in 
any process may be calculated by means of the equation of state. 

In most thermodynamic processes, all the parameters of state are 
changing simultaneously. Nevertheless, a number of simple but at 
the same time practically important exceptions may be singled out. 
These include the processes mentioned above, namely, at constant 
volume (isochoric) and at constant pressure (isobaric), as well as 
the processes occurring without heat exchange (adiabatic) and at 
constant temperature (isothermal). 
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59. Work and Cycles 


In mechanics, work is usually represented as the product of a 
force and a distance. In thermodynamics, we are usually interested 
in the work of changing the volume of a body. Fig. 76 shows the 
shape of a body in two states. The 
volume is shown to have changed 
from v, to v,. The total work of 
changing the volume may be consid- 
ered as the summation of the work 
expended in displacing the elements 
of area dS by the distance dl. If the 
applied forces are perpendicular to 
the surface, the work of displacing 
a surface element is equal to fdi or, 
in terms of pressure, pdSdl. Thus, 


dA= pdo, 





where dv is an element of volume change. It is evident that the 
total work is given by the definite integral: 
V2 
A=\ pdo. 


U1 


In a pressure-volume diagram, the work of compression or expan- 
sion can be represented geometrically. It is simply the area under 
the curve (bounded on the left and the right by vertical lines through 
the points representing the initial and final values of the volume). 

If the pressure during the process of compression or expansion 
remains unchanged and if, moreover, it is the same at all points 
on the surface, then p may be brought out from under the integral 
sign and the formula for work becomes 


A=p (v,—v,). 


As we have already stated, work may be considered positive or 
negative depending on the convention adopted. We have assumed 
that work is positive when a body does work on the surrounding 
medium, i.e., work of expansion. Accordingly, work of compression 
is negative. 

If a body is transferred from state / to state 2 as a result of 
some process, and then transferred back to its original state via the 
same path, the total work of such a process is naturally equal to 
zero. Here, the work of expansion, performed on external bodies, 
is equal to the work of compression, performed by external bodies 
on the system under consideration. However, the situation is completely 
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different when the “forward” and “return” paths differ. Processes in 
which a body is returned to its original state via a different path 
are called cyclical processes. Fig. 77 shows two such cycles and the 
arrows indicate the direction of the processes. One process proceeds 
in a clockwise direction and the other in a counterclockwise direc- 
tion. A process going from left to right signifies expansion. Thus, 
in the clockwise cycle the work of expansion is greater than the 





Fig. 77 


work of compression. In this case, work is performed on the 
surrounding medium. It is evident that in the counterclockwise 
cycle a certain amount of work is performed on the system under 
consideration. In either case, the work performed during a cycle is 
represented by the enclosed area (hatched in the figure). 


60. Processes Involving a Change of Gas State 


Let us consider the relations for the four simplest processes involv- 
ing a change of gas state, whereby, in the main, we shall restrict 
ourselves to gases obeying the equation of the gas state. It will be 
presently seen that knowing the equation of state for a substance 
and applying the first law of thermodynamics, a number of valuable 
conclusions may be drawn regarding the behaviour of the body under 
various conditions. The first law of thermodynamics for gases will 
be used in the form 


AQ = dU + pdv. 


The Isochoric Process. At constant volume, the first law of thermo- 
dynamics assumes the form 


AQ=dU. 


Heat exchange occurs between the system under consideration and 
the external medium, but no work is performed on the external 
medium or the system under consideration. Two possibilities exist: 
either 1) the body absorbs heat from the surrounding medium and 
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its internal energy increases or 2) the body gives up heat to the 
medium and its internal energy decreases. 

The quantity of heat required to increase the temperature of a 
body by one degree at constant volume is called the thermal capacity 
at constant volume and is designated by the letter c with subscript v: 


<= (ar) 
0” \ dT /v=const * 


If the dependence of the internal energy of the gas on the temper- 
ature is known, the thermal capacity c, may be calculated. 

At high temperatures, there is a linear dependence between the 
internal energy of a gas and the temperature, since the thermal 
capacity in this case does not depend on the temperature. 

We are unable here to prove an important theorem. It follows, 
however, from the general laws of thermodynamics that if the 
dependence between p and 7 is linear, then c, cannot depend on 
the volume. Since such a linear dependence exists for gases obeying 
the equation of the gas state and Van der Waals’ equation, then c, 
does not depend on v for gases and the phrase “at v=const” may 
be omitted in the above formula. Thus, 


C= wu (for gases). 


If the dependence of c, on temperature is only slight, the internal 
energy of a gas may be represented by the formula 


U=c,T +const. 
For ideal gases, the constant does not depend on the volume and 
may be dropped. For a gas obeying Van der Waals’ equation, the 
constant equals —< . Thus, 
U=c,T for an ideal gas 
and 
U=c,T—= for a gas obeying Van der Waals’ equation. 


We see that, in the case of an ideal gas, a change in the volume 
of the gas when the temperature is maintained constant does not 
involve a change in energy (see p. 174). If the molecules are drawn 


together with a force per unit area of p’= o: then upon expansion 


of the gas the energy increases by the amount of work done against 
this force, i.e., by 


p'dv= — < -+const. 
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The Isobaric Process. In this process, all three terms in the equation 
for the first law of thermodynamics are different from zero. The 
system exchanges heat and work with its surroundings without a 
change of pressure occurring in the system. This process usually 
involves absorption of heat by a body from its surroundings; however 
not all the heat serves to raise the internal energy of the body and, 
in part, it is returned to the surroundings in the form of mecha- 
nical work. We shall not consider other cases. 

It is perfectly evident that the thermal capacity in this process 
will differ from that in the isochoric process considered above. In an 
isobaric process, the heat is used not only for raising the temperature. 
Hence, c, (thermal capacity at constant pressure) must be greater 
than c,. The difference may, in some cases, be calculated. 

Let us divide both sides of the equation for the first law of ther- 
modynamics by an incremental temperature: 

AQ dU dv 
Car art Par’ 


This expression for the thermal capacity is valid for any process, 
including the isobaric process under consideration. For gases, this 
formula may be rewritten as follows: 


do 
(p= CotP ar - 


For an ideal gas, the result obtained is very simple. Since pu=pRT, 


then a —& and Cp=CytpR. Thus, the difference between the ther- 


mal capacities at constant pressure and at constant volume is equal to 
the number of moles of gas multiplied by the universal gas constant. 
Then, for molar thermal capacities, 


C,—Cy=R. 
Since R=2cal/mole deg, 


Cp—Cy = 2cal/mole deg. 

The Isothermal Process. In order to avoid confusion, it should first 
be emphasised that constant temperature in no way signifiessthat 
no heat exchange occurs between the system and the surroundings. 
A system may absorb heat from the surroundings but not use it to 
raise the temperature. Thus, as is well known, the internal energy 
of a body may increase at constant temperature (e.g., melting ice). 
Moreover, for gas processes, there is another possibility (more important 
than the first): A system may return part of the heat received from 
the external surroundings in the form of mechanical work. 
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In the case of actual gases, both ways of expending the heat are 
entirely possible in an isothermal process. The heat transferred to 
a gas causes the gas to expand without the temperature being 
raised, whereupon 1) work is performed on the external surroundings 
and 2) the potential energy of the interacting molecules is increased. 

In the case of an ideal gas, whose internal energy depends only 
on the temperature and therefore cannot change in an isothermal 
process, the first law of thermodynamics assumes a particularly simple 
form. Since dU = 0, then AQ = AA. Hence, either the system expands, 
absorbing heat from an external source and performing work on 
some object, or, conversely, the system contracts, releasing heat and 
obtaining energy in the form of mechanical work from external 
bodies. An ideal gas transforms energy in an isothermal process. It 
obtains energy from the surroundings in one form and returns all 
of it to the surroundings, but in another form. 

It is not difficult in the case of an ideal gas to go over from 
the differential form AQ=pdv to the integral form. The work (we 
can just as well say heat since work and heat are equivalent) of 
an isothermal expansion from volume v, to volume vy, is 


eaten 


Substituting for pressure from the equation of the gas state, and 
bringing the temperature out from under the integral sign since it 
is a constant, we obtain 
U2 
> du U, 
A=pRT | SF =RT In. 

It should be noted that the work of equal numbers of isothermal 
expansions at different temperatures differ, being greater the higher 
the temperature. Thus, doubling the volume of a mole of some ideal 
gas at a temperature of 300° K (room temperature) requires 8.31 x 
x 300 x In 2 = 1,730 joules of work, while at a temperature of 3,000° K 
the work required is ten times as much, i.e., 17,300 joules. 

An actual isothermal process may be difficult to achieve. In 
any case, in order for the process to beeven approximately iso- 
thermal, the walls of the vessel through which the substance comes 
in contact with the surroundings must be perfectly heat-conducting. 
Moreover, the process must proceed very slowly, so that the heat 
(or work) is able to return to the surroundings in the form of 
work (or heat) instead of accumulating in the system. 

The Adiabatic Process. Adiabatic compression and expansion oc- 
cur in the absence of heat exchange with the surroundings. This 
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may be achieved by providing conditions that are in a sense the 
reverse of those for an isothermal process, i.e., perfect thermal in- 
sulation must be provided and the process must proceed very rap- 
idly, so that heat is not able to escape from the system or be 
transferred to the system. In the case of compression, in accord- 
ance with the first law of thermodynamics, which is now writ- 
ten in the form 
pdv= — du, 


{he mechanical work is converted into internal energy of the body. 
In the case of expansion, on the other hand, the work is performed 
at the expense of a decrease in the internal energy of the system 
under consideration. 

For the three processes considered above, the changes occurring 
in the pressure, volume and temperature were quite apparent, and 
for gases followed directly from the equation of state. In an adiabat- 
ic process, the nature of the change in the parameters of state is 
not apparent, since all three parameters of state.change. The si- 
multaneous solution of two equations—the equations of the gas 
state and the first law of thermodynamics—enables us to deter- 
mine the relationships. Since only the principle involved interests 
us, we shall restrict ourselves to an ideal gas in order to simplify 
the mathematical calculations. Using the expression for the thermal 


capacity of a gas at constant volume, Coy and replacing the 
pressure p by ae we obtain: —#R oo Assume that in the 


initial state the gas parameters are v,, p, T, and in the final 
state v,, p,, T,. Integrating the last equation from the initial to 
the final point of the adiabatic process, we obtain 

_uR ae BR iy 
= ss 


Si int=In 
Cy J U uv 
1 


Ts 
ie : is 


1 


Recalling that c,—c,=wR and introducing the designation <? = “7 
v 


Ty_( 24 
Ty MGs zs 


It is seen from this equation that for adiabatic compression the 
temperature increases and for adiabatic expansion falls. Various 
examples can be cited. Thus, a gas is rapidly expanded when we 
desire to cool it and carbon dioxide gas escaping from a gas tank 
may turn into dry ice due to the tremendous drop in temperature 
of the expanding gas. On the other hand, adiabatic compression 


we obtain 
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may be used, for example, to ignite some substance. The following 
experiment is often demonstrated: A wad of cotton wool soaked 
in ether is placed in a vessel containing air. Rapid compression of 
the air by means of a plunger causes the cotton wool to burst into 
flame. 

Since we wish to represent the gas processes on a pressure-volume 
diagram, it is mecessary to convert the above equation of the 
adiabatic process into an appropriate form. Substituting for the 
temperature by means of the equation of the gas state, we obtain 


Y 
PiU = p,vh. 


Comparing this equation with the Boyle-Mariotte law for an isother- 
mal process, important differences may be observed in the nature 
of the pressure change for compression or expansion. For an isother- 
mal expansion or compression, the product pu remains unchanged, 
while for an adiabatic process the product pot remains unchanged. 
Since y>1, the adiabatic on the diagram is steeper than the iso- 
thermal. When the volume is reduced to one-half in an isothermal 
process the pressure doubles, while in an adiabatic process the 
increase is greater. For example, in the case of most diatomic 
gases, for which y=1.4, when the volume is reduced to one-half, 
the pressure increases to 2.63 times its original value. 

It has already been emphasised that both processes are of an 
ideal character and that the requirements for the creation of the 
ideal conditions of these processes are opposite. Therefore, it is 
evident that gas processes under actual conditions yield interme- 
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adiabatic | compres- 
Sion, the gas becomes 


Fig. 78 


heated, so that for one 
and the same reduc- 
tion in volume the in- 


crease in pressure is 

greater in the adiabatic process, since heating at constant volume 
leads to an increase in temperature. 

Fig. 78 shows that the work of isothermal expansion is greater 

than the work of adiabatic expansion. On the other hand, the work 
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er 





of isothermal compression is less than the work of adiabatic compres- 
sion. We are assuming, naturally, that the initial points of the proc- 
esses coincide. 

The work of an adiabatic process may be determined graphically 
or by means of formulas. From the first law of thermodynamics, it 
follows that in adiabatic processes the work must equal the change 
in internal energy: 


2 
A=\ pdo=U,—U,. 


1 


In the case of ideal gases, the difference in energy is calculated simply 
as U,—U, =c,(T,;—T,). Thus, for ideal gases, the work may be 
calculated by means of this formula. 

Another method may be used to determine the work in an adiabatic 
process. Since the equation p,v,'= pu’ holds for any intermediate 
point of the process, where the symbols without subscripts desig- 
nate the current values of pressure and volume respectively, the work 
integral may be written in the form 


V2 


d 
A=p,vt \=. 


Mm. 


Upon integrating from the initial to the final point of the process, 
we obtain 
Py / 1 1 
an P45 (Jaga). 


Naturally, this formula is equivalent to A =c, (T,—T,.), which 
is easily demonstrated by using the equation of state of an ideal gas 





1 
and converting the above formula (taking wet out of the 
brackets) to the form 
__ BRT, v, \t7! 
Gres aes I: 


Depending on the given data, one formula may be more conveniently 
used than the other. ; 


Let us illustrate by a simple numerical example the statement made in 
Chapter IX to the effect that the increments AQ and AA (note: AQ ~AA) are 
not total differentials, i.e., they do not characterise the change of state of a 
system. 
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Assume state (1) of a mole of hydrogen (Fig. 79) is characterised by the 
following data: vi=20 litres, T1=300°K and p= "5 — 30 calflitre (here R= 
=2 cal/deg). Now, Cp —y=R and, since hydrogen is a diatomic gas, 
“21.4, Hence c,=7 cal/deg and c,—5 cal/deg. 


” We shall now consider three possible paths by means of which the gas can 
change to state (3), where v,=40 litres, 7,=300°K and p,=15 cal/litre: 
1 2 Path 1-8. The work along the isotherm 


p A,_s=RT, In=+=416 cal. These 416 calo- 


ries are absorbed from the hot body, and 
the internal energy U=const since Ti=T,. 

Path 1-2-3. Here, (1-2) is an isobar. Hence, 
T2=600°K. The heat absorbed from the 


hot body is Q,_2=¢,(T,—T ,)=2, 100 cal and 
the work against “the external forces is 
3 A,_,-=p,(0,— v,}=600 cal. Therefore, the 


4 internal energy of the gas increases by 
\ AU=2,100—600=1,500 cal. Process 2-3 
constitutes isochoric cooling and Q,_,= 
i Y =c,(T,—T,)=1,500 calories of heat are 
uy Up=U3 transmitted to the cold body. Since v,=v,, 
: no mechanical work is performed. 

Fig. 79 Thus, along the path 1-2-3, the hot 
body gave up 2,100 cal, 600 cal of work 
was performed and the cold body absorbed 1,500 cal. Along the path 1-2, 
the hot body gave up 416 cal, 416 cal of work was performed and no change 
in the state of the cold body occurred. However, the change in the state of the 

gas was the same for both cases. ; 
Path 1-4-3. Here (1-4) represents an adiabatic process, while (4-3) 

1 1 

t 


1 
, So that v,=20X2 - litres. From the 


1 
rn ‘i U, Pa af =2 
is an isobar. Now, a Gr = 


1 
M% 7 tee ‘ cane 
relation —-=(——- , we find that T,=300x2 * Along the path 


UU 
1-4, the work against the external forces is performed only at the expense of 


1 
— 1 
a decrease in the internal energy: A,_,=— cy (7, —7,)=1,500(1 27 cal. 
Along the path 4-3, the hot body gives up Q,_3= c, (T;—T)= 
1 


—-—1 
=2,100 (1—2 7 cal of heat and the work against the external forces equals 
: q 


—_—=1 
Ag_s=P (Us—0,) =600(1—27 — )ecal, 
Therefore, along the path 4-3, the internal energy increases just by 
1 


——1 
1,500(1—2 % ) cal. Thus, the path 7-4-3 has also not led to a change in 
the internal energy of the gas, which is uniquely determined by the temperature. 


Measuring the Thermal Capacity of Gases. It would appear that 
the easiest way to determine the thermal capacity of a gas is to fill 
a container with the gas to be measured and immerse it in a calori- 
meter. However, this does not take into account the fact that the 
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thermal capacity of a gas is very small with respect to the thermal 
capacity of a container, no matter what solid material is used to 
make the container. Therefore, the thermal capacity of a gas is not 
measured at constant volume, but rather at constant pressure. For 
this purpose, gas under constant pressure moves in a coiled pipe 
that passes through the calorimeter. By means of a thermocouple, 
the temperature of the gas is measured at the input and output of 
the calorimeter. After preliminary heating, the gas enters the calo- 
rimeter and transfers part of its heat to the water. Knowing the 
quantity of gas passing through the container during a particular 
period of time, and the quantity of heat transferred to the water of 
the calorimeter during the same interval of time, the thermal capac- 
ity of the gas at constant pressure, c,, can be easily determined. 
This is done by dividing the quantity of heat by the mass of flowing 
gas times the difference in gas temperature between input and output. 

To determine the thermal capacity at constant volume, we use 
the ratio of thermal capacities, i.e., Poisson’s coefficient: yeZ. 
Many methods of determining y have been proposed, some of them 
being based on the measurement of the volume and pressure of the 
gas in a succession of states for an adiabatic process. Other rela- 
tionships between the thermal capacities may also be used, e.g., the 
relationship defining the difference between the thermal capaci- 
ties c, and c,. 

The thermal capacities of various gases are given in the table. 

















ey cp 
Gas cal/deg mole | cal/deg mole 7 
Helium He ......... 2.98 5.00 1.67 
Hydrogen Hp ........ 4.87 6.87 1.41 
Nitrogen N, 2... 2... 4. 4.86 6.84 1.41 
Oxygen OQ, ......-.-. 4.99 6 90 1.40 
Water vapour H,O...... 6.65 8.65 1.31 
Methane CH, ........ 6.51 8.51 1.30 
Ethyl alcohol C.H;OH ... . 18.9 20.9 1.11 





61. The Joule-Thomson Process 


This is the process in which gas is allowed to flow through a small 
opening from a region of high pressure p, into a region of low pres- 
sure Pp. The vessel in which the process takes place is thermally in- 
sulated from the surroundings. 

In accordance with the conditions of the process, p, and p, must 
not change. This is done by having both pistons (Fig. 80) move tothe 
right,corresponding to the passage of gas into the region of low pressure. 
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M, the mass of gas that is moved from left to right, does not main- 
tain constant volume, but changes from v, to v2, for it has entered 
a region of different pressure. This transition is accomplished under 
the action of the left piston and 
the counteraction of the right 
one. The left piston does work 
at the constant pressure p. This 
work is equal to p Av, where Av 
is the change in the volume of 
the gas to the left of the parti- 
tion. But v, is the change in vol- 
4 ume on the left, so that the work 
done by the left piston is p,v,. 
The right piston does negative 

Fig. 80 work, which in this case is equal 

to the product of the pressure py 

and the incremental volume v,. Thus, when the mass of gas M 

is transferred from the left region to the right one, the work per- 

formed is py0,—p¥,. The law of conservation of energy requires 

that the internal energy of the gas change by this same amount. 
Therefore, 





: U,—U,=p,v,— P,v,. 
This formula is valid for any mass of gas. This means that in the 
process of moving a gas from one vessel into another the quantity 


U-+ pu = const 


(called the heat function or enthalpy) does not change. 

For an ideal gas, U and pv both depend only on the temperature. 
Thus, during a Joule-Thomson process, the temperature of an ideal 
gas does not change. 

For actual gases, the situation is different. If the gas is not ideal, 
the temperature may increase or decrease during a Joule-Thomson 
process, depending on the nature of the interacting forces between 
the molecules. 

It is noteworthy that a particular gas may behave differently at 
different temperatures. At a high temperature, the temperature in- 
creases during a Joule-Thomson process, while at a low tempera- 
ture, it decreases. The point of inversion corresponds to the temper- 
ature at which the change in sign occurs. This temperature for 
oxygen and nitrogen is above room temperature. Therefore, the 
air is cooled during a Joule-Thomson process conducted at room 
temperature—not to speak of lower temperatures. For hy- 
drogen, the inversion temperature is very low. The Joule-Thomson 
effect below the inversion temperature finds application industri- 
ally in the liquefaction of gases, 


CHAPTER XI 
ENTROPY 


62. The Principle of Entropy Existence 


In the middle of the last century, an important discovery was made 
regarding reversible thermodynamic processes. It was found that 
side by side with internal energy a body has yet another remarkable 
function of state, namely, entropy. Just as in the case of internal 
energy, entropy includes an arbitrary constant. Experiments yield 
the incremental difference in entropy. If a body or system absorbs 
the heat AQ during an infinitesimally small transition from one 
state to another at a temperature T, the ratio ca is a total differ- 
ential of some function S. This function is the entropy and is thus 
determined by the two following equivalent equations: 


ds= 2 and S,—S,={ 2. 


The statement that a function exists whose differential is “8 
is known as the principle of entropy existence. It is one of the most im- 
portant laws of nature and an essential part of the second law of ther- 
modynamics, which will be discussed below. The discovery of this 
principle, as well as the entire second law of thermodynamics, is 
primarily associated with the names of Carnot and Clausius. In 
spite of its somewhat abstract nature, the essence of the principle 
is easily understood and may be summarised as follows: A body may 
change from one state to another in an infinite number of ways (rep- 
resented on a diagram by the various curves connecting the same 
initial and final points); and, although the body may absorb var- 


ious amounts of heat during such transitions, the integral [ “2 
1 


will in all cases have the same value. =¢ , the ratio of the quan- 
tity of heat to the temperature at which this heat was absorbed, is 
sometimes called the reduced heat, Since an integral may always 
be approximately represented as a summation, the change of entropy 
in transferring from one state to another is equal to the summation 
of the reduced heats. Let us assume that the body absorbs a calorie 
per degree as it is uniformly heated from 20° C to 25° C.The increase 
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in entropy is then 


I cal lcal I cal 1 cal I cal 
S,—S, ™ 293.5° + 294,5° we 295.5° + 296.5° Be 297,5° * 
For isothermal processes, the expression for the change in entropy 
is very simple: 
S,-S,=4, 


2 


where Q is the heat absorbed during the process. Thus, when 1 kg of 


ice melts the entropy of the substance increases by ee == 0,29 a : 


In applying the concept of entropy, the value of entropy at any 
state (e.g., boiling water or melting ice) may be adopted as zero 
entropy. However, in some cases, the value of the entropy of a substance 
at absolute zero is adopted as zero entropy. There is, incidentally, 
some theoretical basis for this (Nernst’s theorem), but we shall not 
ge into it. 

Assuming S =0 at T =0, the entropy of a substance at the 
temperature T may be determined by the formula 


T 
c, dT 
s=\ : 
0 





T 


if the heating occurs at constant pressure. Ascan be seen, the depend- 
ence of the thermal capacity on the temperature must be known 
in order to determine the entropy. 

The entropy may be easily calculated (except for the arbitrary 
constant) if the equation of state of the substance is known. By defi- 


nition, dS = 52. Substituting the value for AQ obtained from the 
equation for the first law of thermodynamics, we obtain 


aT d 
dS = Cy + PF. 


By means of the equation of the gas state, we can eliminate the pres- 
sure from this equation, obtaining: dS=c, +R 2 . Taking the 
indefinite integral, we obtain an expression for the entropy that 


includes an arbitrary constant: 
S=c,InT+uR Inv+const. 


It is also possible to take the definite integral of dS, where the 
‘limits are two states. We then obtain the following expression for 
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the entropy difference between the two states: 
T v 
S,—S,= Cy In -p +-BR In=. 


This is the expression for the entropy of ideal gases. It is seen from 
the formula that the entropy increases as the temperature and the 
volume of the gas increase. Naturally, this is in agreement with 
the general statement that the entropy increases when heat is trans- 
ferred to a body. 


Example: Using the example on p. 171 (Fig. 79), we shall show that the en- 
tropy is indeed a function of the state of a system: 
Path 1-2-3. The change of entropy 


S,—S,= Cy In + Rin = 5 In2$2 In2Q=7 Ind ney 





U1 mole deg* 
1 
The change of entropy S,—S,=5 In —=—5 2 a aes 
The total change of entropy along path 1-2-3 is S,; —S,=2 nese 


Pek te Sao ina 


mole dg" 
Pcth 1-4-3, Since (1-4) is adiabatic, S,—S,=0, S,—S,=cy In 2+ 
cal 
mole deg’ 


It is seen, indeed, that no matter how the transition of the gas from state (1) 
to state (3) is effected the change of entropy is the same. 


5 
7 


U3 Pees 1 
+ 2 In %,=5.In2 7121n2 = 2!In2 


63. The Principle of Increasing Entropy 


As already stated, reversible processes, strictly speaking, do not 
exist. However, many processes occur that do not, practically, 
differ from reversible ones. But there are some processes that are 
always unidirectional and as a result can never be made reversible. 
Thus, gas may expand of itself, but it cannot be compressed without 
the application of an external force. Heat may spontaneously pass 
from a hot body to a colder one, but it can pass from a cold body to 
a hotter one only if work (e.g., electric energy) is expended. In the 
presence of friction, the kinetic energy of macroscopic motion is 
always converted into internal energy, but the reverse process never 
occurs spontaneously. All other irreversible processes are in the 
final analysis based on the fact that in each of them, to one degrte 
or another, one of the enumerated unidirectional processes occurs. 
In actual processes, it is impossible to avoid spontaneous expansion, 
friction and thermal dissipation. 

Do not all the enumerated unidirectional processes have a common 
characteristic? As a matter of fact they do: in ail unidirectional 
processes, the entropy ir.creases. 


7 — 3580 
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In the case of heat exchange between two bodies, the 
overall change in entropy of the entire system is S, —S, = = Ee: 


where Q, is the heat absorbed by the colder body and Q, is the heat 
given up by the hotter body. 
If T, is greater than 7,, thenQ,=—Q,>0, since heat transferred to 
1 
—7-)>0, 
i.e., during heat exchange, there is an increase in the overall entropy 
of the system in which the heat exchange occurs. 
Let us take another case. Assume intensive mechanical motion 
(e.g., rotation of a wheel) takes place in a vessel containing gas. 
The volume does not change, but the temperature increases and, 


hence, the entropy changes by S,—S,=¢,In 2, i.e., it also in- 
1 


creases. 
Finally, upon expansion into an evacuated veer at constant 


temperature, the increase in entropy, S,—S, =pR In— , is again 


positive. Thus, in all unidirectional processes, the entropy of the 
system increases. 

It is easily seen that this conclusion regarding all irreversible 
processes is of great importance. Since each irreversible process is 
accompanied by unidirectional effects serving to increase the entropy, 
the increase in entropy in an irreversible process is greater than the 
increase that would have occurred if the process were reversible. 
Let AQ be the heat absorbed by a body at temperature 7 in an irre- | 
versible process. If the process were reversible, the increase in en- 


a body is considered positive. Hence, S,—S,=Q, (7 


tropy would equal “2. In an actual process, however, the increase 
in entropy is greater than this value: 


dS > 52, 


If the system is thermally insulated, then AQ = 0 and the above 
expression assumes the form dS > 0, i.e., in a thermally insulated 
system, only processes serving to increase the entropy are possible. 

It is quite clear that entropy and internal energy are the most 
important functions determining a thermodynamic process. Thus, 
if entropy is analogous to the manager of a process, internal energy 
is analogous to the bookkeeper. While entropy determines the di- 
rection of flow of the process, the energy “meets the expenditures” 
of conducting it. 

If in the above formula the symbol> is used instead of thesymbol>, 
the law of entropy for reversible and irreversible processes may be 
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described by the following simple formula: dS= Ak. This formula 


expresses the essence of the second law of thermodynamics. For 
closed systems, the second law states that the entropy of a thermally 
isolated system increases or remains the same. 

Both laws of thermodynamics may be combined in the single 
formula dS = SU + pd , which is applicable to all practical thermo- 
dynamic problems. 

Examples. 1. In the example on p. 78, we considered the nonelastic collision 
of a bullet with a ballistic pendulum and showed that, upon collision, 399.6 kg-m 
of mechanical energy are dissipated in the bullet-pendulum system. This means 
that the bullet irreversibly transfers A Q=935 cal to the pendulum through 
heat conduction. If it is assumed that the process is isothermal (i.e., the ther- 
mal conductivity of the pendulum is extraordinarily high), and that the tem- 
perature is, say, 27°C, then the entropy of the system in this irreversible process 
will increase by 


AS=*2—3.12 cal/deg. 


2. A rubber ball weighing 0.3 kg rises 1 metre off the floor after being dropped 
from a height of 2 metres. In this isothermal process (assume ‘=27°C), 
we transfer AQ=0.3 kg-m irreversibly, i.e., the entropy of the bal!l-floor sys- 
tem increases by 

AS = 2.35 & 1075 cal/deg. 

If the ball and floor were absolutely elastic, the entropy would not have 
changed (AS=0) and the motion of the ball would have continued eternally. 

3. Let us consider the irreversible process involved in the transfer of heat 
from a steam boiler to a condenser. Assume that the steam boiler is at a temper- 
ature #:=300°C and the condenser at a temperature ¢2=30°C. For a boiler 
thermal capacity of 10,000 kw and an efficiency of 25 per cent, 7.510% joules 
will be transferred from the boiler to the condenser every second. Since the 
boiler loses heat, its AQ will be negative, i.e., the boiler loses entropy. For 
the condenser, on the other hand, the entropy increases. However, since T1>T2, 
the entropy of the boiler-condenser system will increase each second by 


AS=AQ (a rr) =2.81x 10° cal/deg. 


64. The Principle of Operation of a Heat Engine 


A heat engine converts heat into work. In other words, it takes 
heat from some bodies and transfers it to others in the form of me- 
chanical work. In order to accomplish this conversion, we must have 
at our disposal two bodies at different temperatures, between whith 
heat exchange is possible. The hotter body will be designated as the 
hot body and the colder one as the cold body. In the presence of two 
such bodies, the process of conversion of heat into work may be 
described as follows: A substance capable of expanding (the working 
substance) is brought into contact with the hot body. Heat Q, is 
taken from the hot body and is expended on the work of expansion, 


7* 
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A,, which is transmitted to surrounding bodies. The working sub- 
stance is then brought into contact with the cold body and transfers 
heat Q, to it at the expense of the work A, performed on the working 
substance by the external forces. 

To obtain a continuously operating heat engine, the compression 
process must be concluded where the expansion process began. In 
other words, the overall process must be cyclic. The working sub- 
stance returns to its initial state at the end of each cycle. Hence, 
the law of conservation of energy requires that the energy obtained 
from the surrounding bodies equal the energy transferred to the sur- 
rounding bodies. The working substance obtained the heat Q, dur- 
ing expansion and the work A, during compression. On the other 
hand, it gave up the work A, during expansion and the heat Q, 
during compression. Hence, Q,+ A,=Q,+ A, or Ay— A,= Q,— 
—Q,. When the cyclic process is conducted clockwise, the work of 
compression is less than the work of expansion. Therefore, the last 
equation expresses the simple fact that the net work transmitted 
to a working substance by an external medium is equal to the differ- 
ence in the heat absorbed from a hot body and given up to a cold 
body. Accordingly, the efficiency of the cycle and, hence, of the en- 
gine as a whole is 


= en 
y= 1 0. 


The described process for the operation of a heat engine is, natur- 
ally, an abstract scheme. However, the essential features of every 
heat engine are incorporated in this scheme. An expanding and con- 
iracting gas or steam is the working substance, the surrounding me- 
dium plays the role of cold body, and a steam boiler, or a fuel mixture 
in internal combustion engines, serves as the hot body. 

A refrigerating engine, in which the cycle is reversed, requires 
the same three system components. The principle of operation of 
this engine consists in the following: Expansion of the working sub- 
stance occurs when it is in contact with the cold body. Thus, 
the cold body is cooled even further, which is precisely the task of 
the refrigerating engine. Now, in order to complete the cycle, the 
working substance must be compressed and the heat given up by 
the cold body rejected. This is accomplished when the working 
substance is in contact with the hot body.Thus, the hot body becomes 
even hotter. The “unnatural” transfer of heat from a cooler body to 
a hotter body is at the “expense” of work. We see, then, that when 
the cycle is conducted counterclockwise, the relationship between the 
energy transferred to a medium and the energy absorbed from a 
medium, i.e., Q;-+ Ay = Q.+ Ay or Qo — Q, = — (A,—A,), where 
as before the subscript 1 refers to the portion of the process 
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occurring when in contact with ihe hotter body, has the following 
meaning: The quantity of heat removed from a system must be 
compensated for by an equal quantity of mechanical work. 

The second law of thermodynamics imposes certain conditions 
on the operation of a heat engine. If a process is assumed to be revers- 
ible, the change in entropy of the working substance for the entire 
cycle should equal zero. Stated otherwise, the change in entropy for 
the expansion process must equal (except for reversed sign) the 
change in entropy for the contraction process, i.e., 


aQ dQ 


fi he 
In the case of an irreversible process, the entropy of the closed sys- 


tem, consisting of the hot body, the cold body and the working 
substance, increases and, therefore, 


dQ dQ 
7+) R>0. 


(It should be recalled that Q is an algebraic quantity. Thus, 
heat entering the system is considered positive.) By evaluating 
these integrals for specific processes, it is rather simple in a num- 
ber of cases to determine the maximum efficiency of one or another 
heat engine cycle. 


65. Efficiency of a Carnot Cycle 


We shall now derive the expression for the efficiency of an ideal 
heat-engine operating without losses in a reversible cycle. 

Let us first consider the theoretical four-stroke Carnot cycle 
represented in Fig. 81. The Carnot cycle consists of two isothermals 
(for temperatures T, and T,) and two adiabatics. Assume that 
the first strokes of the cycle is 
an isothermal expansion from state 
1 to state 2— the working substance 
is in contact with the hot body 
whose temperature is 7, and the 
process takes place very slowly. When 
state 2 is reached, contact with the 
hot body is broken, the working sub- 
stance is thermally isolated and it 
has the possibility of expanding fur- 
ther. Work occurs at the expense of 
the internal energy and the tempera- 
ture of the working substance is allowed 
to drop to 7T,. From this point Fig, 81 
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(state 38), two-stroke contraction begins. The working substance 
comes in contact with the cold body at temperature T, and iso- 
thermally contracts to state 4. Here, the working substance is 
again thermally isolated and the contraction continues, now adia- 
batically, with the working substance being heated, at the expense 
of performed work, to the initial temperature T,. 

The adiabatic processes in a Carnot cycle are of an auxiliary na- 
ture, enabling us to transfer from one isothermal to another. These 
processes do not enter into the energy balance, since c,(T,—T,), 
the work of adiabatic expansion, and c,(T,—T,), the work of com- 
pression, cancel each other. 

In an adiabatic process, the entropy of a system does not change. 
During isothermal expansion, the entropy of the hot body decreases 


by - and the entropy of the cold body increases by 4- 2 The 
working substance returns to its initial state with its entree un- 


changed. If the process is reversible, then a = For irreversible 


eer the entropy of the entire system, consisting of the cold 
ody, the hot body and the working substance, increases, i.e., the 


entropy increment ois greater than the decrement oe: 
2 


Loe) 5 LQ 


Thus, #25 F and, therefore, the efficiency of a Carnot cycle is 


T. 
nar = 1—p. 


The efficiency of the cycle is determined by the temperatures 
of the cold and hot bodies, respectively. The greater the drop in 
temperature the greater the efficiency of the engine. It is not dif- 
ficult to see that the efficiency of a Carnot cycle is the maximum 
efficiency possible. There is no cycle better than the Carnot cycle 

and, in this sense, it serves as a model for 
P designers of heat engines. They strive to 
. make actual cycles approach the cycle of 

this ideal engine. 

It is not difficult to prove that the efficiency 
of a Carnot cycle is the optimum. Fig. 82 shows 
an arbitrary cycle inscribed in a Carnot cycle. The 


decrease in the entropy of the hot body may be 
represented by the pape 


( 4Q 
revecs T? 
Fig. 82 Aa 
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for which the inequality 


do= 28 


1 
> r T, 


me by 
“8 


Boo 


is undoubtedly valid, since Ti is the largest value assumed by T in the integra- 
tion. The increase in the entropy of ths cold body is expressed by the integral 


dQ 
T > 
B 
for which the inequality 
A , A 
1 
[<4 [ae 


is valid, since Tz is the smallest value assumed by T in the integration. For 
a reversible process, 














A B 
{| =|) 
T Tr |° 
A 
Therefore, é 
1Q [Q:| 
T aa 
which yields the condition: 
T: 
Nmax = ae 


Thus, the Carnot cycle has the maximum efficiency of all pos- 
sible cycles. 

This maximum efficiency formula shows why steam engines have 
low efficiency. At T,= 300° and T,= 400°, the efficiency is 25 per 
cent. Moreover, this is the maximum efficiency, attained by an 
ideal reversible engine operating without any losses in energy. It 
is, therefore, not surprising that in actual steam engines the efficien- 
cy is below 10 per cent. Courses in steam engineering discuss means 
used to increase the efficiency. Clearly, the most important method 
is to increase the temperature of the hot body, i.e., the steam or 
fuel mixture. 


66. The Second Law of Thermodynamics 


As indicated above, the second law of thermodynamics statgs 
that the entropy in a thermally isolated system increases. This sta- 
tement may appear to be somewhat abstract, but it is not the form 
in which this idea was expressed historically. In view of the tremen- 
dous importance of this law of nature, we shall briefly discuss other 
important formulations of the second law of thermodynamics and 
show that they are equivalent to the above. 
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Historically, the second law of thermodynamics was expressed 
in the form of Thomson’s postulate on the impossibility of creating 
a perpetual engine of the second kind. A perpetual engine of the 
first kind creates work “out of nothing”, i.e., its work violates the 
first law of thermodynamics. A perpetual engine of the second kind 
produces work by means of a periodically operating engine merely 
by absorbing heat from the surrounding medium. If such an engine 
were possible, it would be practically eternal, for the supply of ener- 
gy in the surrounding medium is almost limitless and the cooling, 
say, of the oceans’ waters by one degree would yield an incon- 
ceivably large amount of energy. The mass of water on the Earth 
is of the order of ~ 10'* tons. If this entire mass of water were cooled 
by only 1°, the heat released would be about 1024 kcal, which is 
equivalent to the complete combustion of 10'* tons of coal. Roll- 
ing-stock loaded with this quantity of coal would extend for a 
distance of ~ 10°° km, which is the order of magnitude of the di- 
mensions of the solar system! 

A perpetual engine of the second kind is a heat engine working 
with a hot body, but without a cold body. If such an engine were 
possible, it could work on a single stroke. A gas contained in a cyl- 
inder with a piston could indeed expand, but the operation of the 
engine would end there, since for the engine to continue operating, 
the heat absorbed by the gas must be transferred to a cold body. 
Formally, the formula for maximum efficiency shows that a per- 
petual engine of the second kind is impossible. In the absence of a 
temperature drop (T,=T7,), the maximum efficiency is equal to 
zero. 

It is impossible to design a periodically operating perpetual en- 
gine by combining an isothermal expansion with an adiabatic com- 
pression process. Such a process would not be possible even if we 
could make it reversible. For isothermal expansion of the working 
substance, the entropy decreases. Hence, the compression process 
would have to yield an increase in entropy. This, however, is not 
possible for an adiabatic process, since it proceeds at constant en- 
tropy. 

The postulate of Clausius also completely corresponds to the 
formulation adopted here for the second law of thermodynamics. 
It states that heat cannot be transferred from a colder body to a 
hotter body without compensation. A process contradicting the 
postulate of Clausius would take place with a decrease in entropy. 
At the very beginning of our discussion, this was shown to be impos- 
sible. 

We shall again return to the second law of thermodynamics in 
Sec. 77, where it will be discussed from the standpoint of the kinetic 
molecular theory. 


CHAPTER XIL 


KINETIC THEORY OF GASES 


67. General 


If the molecules of a solid body are assumed to be contiguous, we 
can accurately determine their dimensions by X-ray analysis (p. 384). 
Then by comparing these dimensions with the space available to 
a molecule in a gas, the fundamental properties of the gaseous state 
of matter may be immediately determined. 

The ae linear dimensions of a diatomic molecule of oxygen 
is about 4 A. Nitrogen molecules have approximately the same 
dimension, but molecules of hydrogen are considerably smaller. The 
volume of an oxygen molecule is about 10~** em*. Since under normal 
conditions there are 2.7 x 10° molecules in 1 cm‘ of oxygen, the space 
available to a molecule is about 0.4 10-?? cm*. Comparison of the 
volume of a molecule with the space available to it shows how little 
of the space is occupied by molecules. It is evident that for such a 
low density collisions between molecules will be relatively rare. 
On the average, the length of the path traversed by a molecule between 
consecutive collisions is 1,000 A. However, the velocity of a molecule 
is large, about 500 m/sec, so that on the average a collision occurs 
every ten-thousand millionth (10-*°) of a second. It will be shown 
below how these figures were obtained. 

Molecules begin to draw together only when the distances between 
them become comparable to their own dimensions. Therefore, for 
a large part of their path, molecules move rectilinearly and uniform- 
ly. Only when one molecule comes within range of another does 
the force of interaction become effective. Since the interaction occurs 
over an insignificantly small portion of the path, we can speak of 
a collision between the molecules. The interval of time during which 
molecules perceptibly interact—in other words, the impact time—is 
equal to about 10~** sec. Thus, a molecule spends by far the greatest 
part of its “life” in free motion subject to inertia. 

This is the situation for gases under normal conditions. An ing 
crease in pressure, leading to an increase in density, may considerably 
alter the picture. 

The internal energy of gases in which interaction between mole- 
cules occurs only for the time of instantaneous collision does not 
contain potential energy of interaction between molecules. Such 
gases are called ideal. The use of one and the same term a second time 
will be shown to be justified by demonstrating the validity of the 
equation of the gas state for such gases, 
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Thus, a gaseous substance consists of a tremendous number of 
minute particles that pass through large spaces without colliding, 
then collide like billiard balls and fly apart in different directions 
with different velocities, until the next collision. If we were to trace 
the path of a single gas molecule (naturally, this can be done only 
mentally), we would find it moving now to the left or to the right, 
now forward or backward. Sometimes it would be moving with a 
large velocity and at other times it would be moving slowly. In view 
of the chaotic nature of thermal motion in a gas, the molecules of 
a free gas in thermal equilibrium may be considered to have uniform 
density distribution throughout its volume. Furthermore, at a given 
instant, there will undoubtedly be equal quantities of molecules 
moving in all directions. Other random events will similarly be 
uniformly distributed. For example, at all locations, equal numbers 
of molecules per second of observation will be travelling without 
collision a distance of 100 A to 200 A. 

It must be realised, however, that these statements are of a statis- 
tical character, They are valid on the average, whereby the greater 
the number of gas molecules involved the greater the validity. 

We assert, for example, that the number of molecules moving 
“to the right” is the same as the number moving “to the left”. Natu- 
rally, this does not mean that the numbers are equal to within sev- 
eral units. The number of molecules involved is so large that not 
only is a difference of several units insignificant, but even a differ- 
ence of several million is negligible percentagewise. 

If numerous measurements are taken of the gas density of a given 
volume, the values obtained for the number of molecules will differ 
somewhat from measurement to measurement. From these data, we 
can determine the average value for the number of molecules in the 
volume under consideration. If it were possible to measure within 
an accuracy of ever several thousand molecules, the individual meas- 
urements would oscillate, percentagewise, to an insignificant extent 
about this average value. 

When it is stated that a number of molecules have such and such 
a velocity, or move in some direction or other, or collide in accordance 
with some mechanism or other, then the average value of the number 
is always meant. If the number of gas molecules is large, the devia- 
tions of the instantaneous values from the average, i.e., the fluctu- 
ations, are negligible. In a very rarefied gas, however, the fluctua- 
tions may be considerable. ; 

It is shown in the theory of probability that, using absolute 
values, the average relative deviation of the gas density from the 


average is approximately equal to -\_. where n is the number of mol- 


Va’ 


ecules in a unit volume. Since there are 2.7x10!® molecules in 
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1 cm® of gas, the fluctuation of the gas density within one cubic 
centimetre amounts to 
1 


V 27x10" ’ 


i.e, 210719 from the average value. It is evident that such devia- 
tions are beyond experimental observation. 

This is how matters stand with respect to all gas properties that 
are determined by the average number of molecules. 

The origin of the kinetic theory of gases dates back to Daniel 
Bernoulli (1700-1788). M. V. Lomonosov (1711-1765) also made 
substantial contributions to its development. In the 19th century, 
the kinetic theory of gases developed under Clausius (1822-1888), 
Maxwell (1831-1879) and Ludwig Boltzmann (1844-1906) and 
assumed its modern form. 


68. Mean Free Path 


The distance traversed by a molecule between two consecutive 
collisions (the range of a molecule) is, naturally, a random quantity 
that may sometimes be very small or very large for individual mole- 
cules. However, in view of the chaotic nature of the particle motion, 
the average value of this quantity for a given gas state is undoubtedly 
constant. The mean free path or, for brevity, the range / is related 
to the average velocity v of the molecular motion and the average 
time tv between two collisions by the simple relation: 1—vt*. Typ- 
ical values for these quantities were cited on p. 185 

The range of a molecule depends, in the first place, on the number 
of molecules in a unit volume of gas. Moreover, it is evident that the 
larger the dimensions of the molecule the smaller the mean free path. | 

In order to visualise the character of this relationship, let us con- 
sider a cylindrical volume of gas through which a molecule moves 
along the cylinder axis. What is the path taken by the molecule? 

Molecules are not points. They have dimensions determined by 
the distances for which molecular interaction becomes effective. 

On the basis of crystallochemical measurements (see p.589), we 
may, with considerable accuracy, ascribe acertain form to molecules. 
At distances extending beyond the limits of the molecule’s “bound- 
ary”, the forces of interaction are, practically, not effective. 

Let us project the maximum cross-section of the molecules on to 
the base of the cylinder. Each molecule will be projected differently. 


* Since we are merely concerned with the determination of the connection 
between the physical quantities and not with the determination of the exact 
formulas, we shall not differentiate between average and root-mean-square 
velocities (see below). 
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Since there are many molecules, the average cross-sectional area 
will characterise a molecule with sufficient accuracy. This average 
area of cross-section o is called the effective cross-section. 

A collision will surely occur along the length of ihe cylinder if 
the area of the cylinder base is completely filled with the cross- 
sections of the molecules. If the cylinder base is equal to 1 cm’, 
cylinder length equal to J, and the number of molecules per unit 
volume equal to a, then there will be a total of n/ molecules in the 
cylinder. The projections of the cross-sections of these molecules 
will completely cover the cylinder base when nlo=1. Under these 
conditions, the value of / will have an order of magnitude that is 


: 1 F 
close to the average range of the molecule, i.e., / = 3g + More rigorous 


calculations confirm the validity of this rough estimation. In the 
exact formula, the factor V2 enters in the denominator: 


epee 

V 2no’ 
where o has a constant magnitude for a given gas. Thus, the mean 
free path is determined only by the density. A decrease in density 
by a factor of 100, for example, results in an increase in the mean 
free path by the same factor. 

For air under normal conditions, the effective cross-section o is 
approximately equal to5 x 10~'* em?. This is in excellent agreement 
with the dimensions of oxygen and nitrogen molecules obtained from 
crystal measurements. The maximum dimension is equal to 4.3 A 
and the minimum is a little less, namely, 3 A. The radius of a circle 
having an area of 5x 107'* cm? is 4 A. 

We can determine the dimensions of molecules by studying crys- 
tals. However, the investigation of particle collisions may be viewed 
as a method of establishing the effective cross-section of particles. 
This method is of value in studying atomic nuclei (p. 551). 


The mean free path under normal conditions is: in air—600 A, in nitrogen— 
600 A, in hydrogen — 1,100 A and in helium — 1,800 A. 


69. Gas Pressure. Root-Mean-Square Velocity of Molecules 


Let us consider the problem of using the simplified concepts re- 
garding the motion and interaction of gas molecules to express the 
gas pressure in terms of the quantities characterising the molecule. 

Assume that we have a gas enclosed in a spherical tank of radius 
R and volume v. Disregarding collisions between gas molecules, we 
may adopt the following simple scheme for the motion of each mole- 
cule: A molecule moves rectilinearly and uniformly with some ve- 
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locity v, strikes the wall of the vessel and rebounds at an angle equal 
to the angle of incidence (Fig. 83). Traversing chords of equal length, 
2R sin 9, time after time, the molecule strikes the wall of the 


vessel times per second. For each impact, the momentum 


uv 
2R sin @ 
of the molecule changes by 2 mv sin 0 (see p.66). The change in 
momentum per second is equal 
to ue 


We see that the angleof incidence 
cancels out. If the molecule strikes 
the wall at an acute angle, the 
impacts will occur often, but will 
be weak. If the angle of incidence is 
close to 90°, the molecule will strike 
the wall less often, but will make 
up for it by stronger impacts. 

The change in momentum for 
each impact of the molecule on the 
wall contributes to the overall Fig. 83 
force of the gas pressure. It may be 
assumed in accordance with the fundamental law of mechanics that 
the force of the pressure is simply the change occurring in the momen- 

; 2 mv? 
tum of all the molecules in one second: ae E +... or, factoring 
out the constants, 








m 2 2 
Bp Cito + sig .). 


Assuming n molecules are contained in the gas, we may introduce 
the concept of the average of the velocity squared of a molecule, 
which is determined by the formula 


=, 1 
vu => (of top+...). 


The expression for the force of the pressure may now briefly be written 
as follows: 





mnv? 
F= 


R 


Dividing this expression by 41R2, the surface area of a sphere, we 
obtain the gas pressure: 
nmo* 
P= Tar 
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Replacing 4nR* by 3V, the following interesting formula is obtained: 


~ 2 
pV = + no? or pV=sn (75) : 

Thus, the gas pressure is proportional to the number of gas mol- 
ecules and to the average value of the kinetic energy associated with 
the translatory motion of a gas molecule. 

A very important conclusion may be drawn by comparing the ob- 
tained equation with the equation of the gas state. Comparison of 
the right-hand members of the equations shows that 

2 aig 
i.e., the average kinetic energy of molecular translation depends 
only on the absolute temperature and, moreover, is directly pro- 
portional to it. 

This conclusion shows that gases obeying the equation of the gas 
state are ideal in the sense that they approximate the ideal model 
of a group of particles whose interaction is insignificant. Further, 
it shows that the concept of absolute temperature, introduced empir- 
ically as a quantity proportional to the pressure of a rarefied gas, 
has a simple kinetic-molecular interpretation. The absolute temper- 
ature is proportional to the kinetic energy of molecular translation. 


The ratio ~—N is known as Avogadro’s number. It is the number 
of molecules in one gram molecule and is a universal constant: 
N =6.02 x 10?*. The reciprocal quantity 7 is equal to the mass of 
a hydrogen atom: 

My = 77 = 1.66 x 10-** gm, 


Another universal constant is the quantity 


k= !R_ * _ 1.38 x 10-™ erg/deg, 


which is called Boltzmann’s constant. Thus, 

me 3 

ag =a AT. 
If the velocity squared, v?, is represented by the sum of the squares 
of its components, v? =v; +vy+u:2, it is evident that the average 
energy for each component is 


RT. 


te] 
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This quantity may be described as the energy associated with one 
degree of freedom. 

The universal gas constant is accurately known from experiments 
with gases. The determination of Avogadro’s number and Boltz- 
mann’s constant, which are expressed in terms of each other, is a 
relatively difficult task involving delicate measurements. 

These results put at our disposal useful formulas for calculating 
the average molecular velocity and the number of molecules in 
a unit volume. 

Thus, for the average of the velocity squared, we obtain 


where M is the molecular weight. The square root of the average of 
the velocity squared is called the root-mean-square velocity: 


3kT 3RT 
Urms = Vy OF Urns = ye , 


i.e., the r-m-s velocity is directly proportional to the square root of 
the temperature and inversely proportional to the square root of 
the molecular weight. It is easily determined that at room temper- 
ature oxygen molecules have a velocity of 480 m/sec and hydrogen 
molecules—1,900 m/sec. At the temperature of liquid helium, these 
molecules would have, respectively, velocities of 40 m/sec and 
160 m/sec, while at the temperature of the surface of the Sun, namely 
6,000°, these velocities would be 2,160 m/sec and 8,640 m/sec, 
respectively. These examples are unrealistic, however, for, at the 
temperature of liquid helium, oxygen and hydrogen solidify and no 
translatory motion of the molecules will occur, while at the temper- 
ature of the surface of the Sun the molecules disassociate into atoms. 

We obtain the following simple expression for the number of 
molecules in a unit volume: 


Avogadro’s law follows from this and may be stated as follows: 
For equal pressures and temperatures, all gases contain the same 
number of molecules per unit volume. Thus, under normal conditions 
(at a pressure of | atm and a temperature of 0°C), 2.683 x 108 mole- 
cules (Loschmidt’s number) are contained in | cm, 
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70. Internal Energy of a Gas 


The properties of monatomic gases are determined by the kinetic 
energy of translation of the molecules. An atom’s internal energy 
does not affect the thermodynamics of the gas. Evidently, the inter- 
nal energy need be considered only when the temperature of the gas 
is very high and collisions between atoms may lead to their excita- 
tion and ionisation. These processes will be discussed in detail 
later on. 

Thus, the following formula for the internal energy of a monatomic 
gas will have very broad application: 


mv? 
U=N-, 


where N is the number of molecules. Using the formulas of the pre- 
vious article, we obtain for | mole of an ideal monatomic gas the 
expression 


3 
U= RT. 


Hence, for the thermal capacity of 1 mole of a monatomic gas, we 
obtain by means of the formulas of Sec. 60: 


3 
C=a R 
and 
5 
Cp= ah. 


The direct proportionality between the temperature and the in- 
ternal energy, and hence the constancy of the thermal capacities 
of a monatomic gas, are valid for quite a broad interval of external 
conditions. 

For polyatomic gases, such a simple picture is valid for a signif- 
icantly narrower interval of temperatures, if valid at all. The reason 
for this is that the energy of a polyatomic molecule consists of the 
energy of translation, the energy of rotation and the energy of vibra- 
tion of the molecule’s components (i.e., the molecule’s atoms) 
with respect to each other. Calculation of the average energy per 
molecule becomes quite difficult. It turns out that the energy of a 
molecule is no longer linearly dependent on the temperature and, 
hence, that the thermal capacities are no longer constants independent 
of the magnitude of the temperature. Nevertheless, it is usually pos- 
sible to find a narrow interval of temperatures in which the thermal 
capacities do not depend on the temperature. This occurs for such 
values of temperature at which the average energy of the molecule 
is not yet sufficient for the collisions of the molecule to Jead toa 
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change in its vibratory state. At the same time, this energy is suf- 
ficiently large so that the discrete (quantum) character of the energy 
of rotation is not felt. Jumping ahead and referring the reader to 
Fig. 266 (p. 601), it may be stated that linear dependence between 
energy and temperature, and constancy of thermal capacity, will 
occur when the quantity kT, descriptive of the order of magnitude 
of the translational energy of the molecule, is considerably greater 
than the distances between rotational energy levels and less than the 
distances between vibrational energy levels. 

If such an interval exists, the energy of a mole of gas and the ther- 
mal capacities of this quantity of gas are expressed by the following 
simple formulas: 


U=3RT, 
C,= 38k, 
Cy = 4R. 


The doubling of the internal energy and c, with respect to a mona- 
tomic gas may be explained in the following manner. A polyatomic 
molecule has six degrees of freedom, while a monatomic molecule has 
three. Since there are twice as many degrees of freedom, there is twice 
as much internal energy. To be sure, there is nothing self-evi- 
dent about this statement. However, we find support for this view- 
point when we consider a gas consisting of diatomic molecules, 
Since a diatomic molecule is a system consisting of two particles, 
it possesses five degrees of 
freedom (see p. 42). If the 
internal energy is indeed 
proportional to the number 
of degrees of freedom, then 
for a gas consisting of dia- 
tomic molecules the follow- 
ing formulas should be 
valid: 


5 5 
U=ZRT, Saas 


and =F 

Experiments show that in 
the temperature range in 
which the thermal capacity 
remains unchanged these formulas are quite applicable. The internal 
energy of one mole of a diatomic gas at a room temperature of 
300°K is 1,500 cal =6,250 joules. 

A typical dependence curve for thermal capacity over a broad 
interval-of temperatures is illustrated in Fig. 84. 





Fig. 84 
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71. Statistical Distribution 


Numerous events occur that cannot be predicted. These are called 
random events. The height of a young man appearing for military 
service, the number of pedestrians passing a particular crossing 
during certain hours, and the number of winning tickets in a loan 
lottery falling on each series of one hundred bond numbers are all 
examples of random events. The results obtained by observing nu- 
merous events of a single type, for example, measuring the height 
of a large number of young people, counting the number of pedestri- 
ans per minute over a large number of days, or analysing the number 
of winning tickets for a large number of loan lotteries, may be summa- 
rised in the form of a so-called distribution curve. In the case of the 
height of a person, the data may be processed in the form of numbers 
indicating the number of men called up for military service whose 
heights are between 170 cm and 171 cm, between 171 cm and 172 cm, 
etc. Thus, the probability of observing a person among those called 
up who has precisely a given height (e.g., 171.34 cm) is practically 
equal to zero. Therefore, it is better to refer only to the number of men 
called up having a height lying in a particular interval. 

In the case of the analysis of the prize list, the distribution curve 
may be constructed on the basis of the data for the number of series 
of one hundred bonds for which there was not a single winner, for 
which there was one win- 
ner, two winners, etc. 

If we construct a graph, 
plotting the random quan- 
tity (e.g., height, number 
of pedestrians or number 
of winners) along the ho- 
rizontal axis and the num- 
ber of random events (e. g., 
number of people hav- 
ing a height lying in a 
particular interval, the 
quantity of cases of a given 


Number of random events 


Random Lit number of winners per one 
: aia hundred numbers, _ etc.) 
Fig. 85 along the vertical axis, the 


obtained curve is a distri- 
bution curve. An example of such a curve is shown in Fig. 85. 
The curve is drawn through the mid-points of the tops of the rec- 
tangles. Each rectangle has an area equal to the number of times 
a random event occurred for the quantity lying in the given inter- 
val. 
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The remarkable feature of distribution curves is their reproduci- 
bility. If we construct distribution curves analysing the height of 
young men called up for military service for a number of years, it 
will be seen that the curves are entirely similar. This similarity 
will not be found if westudy the height distribution curves construct- 
ed on the basis of a small number of measurements. As we increase 
the number of measurements on which each curve is based, the curves 
for different years will become more and more similar. The same 
holds true for the distribution curves of all events, provided the events 
are random and the conditions of the obtained curves do not change. 

We call the distribution law of one or another quantity a statis- 
tical law. It is more accurately given the greater the number of events 
used to determine each ordinate of the curve. 

Naturally, knowing the distribution curve does not enable us to 
predict the number of a bond that will win in the next lottery. How- 
eVer, we can say, for example, what portion of the series consisting 
of one hundred numbers each will have one winner. The greater the 
number of bonds used in the analysis, the greater the accuracy of 
this prediction. 

In view of the large number of molecules contained in very small 
volumes of matter, all kinds of statistical predictions about the 
behaviour of molecules are made with particularly high accuracy. 
A distribution curve of one or another random quantity plotted for 
the molecules of a substance will be reproduced with tremendous 
accuracy because each “rectangle” of the distribution curve corre- 
sponds to thousands of millions of molecules. 


72. Boltzmann’s Law 


Certain ideas about the distribution of molecules follow immedi- 
ately from the chaotic nature of thermal motion. This applies to 
the velocity distribution of the molecules according to direction and 
to the volume distribution of the molecules for the case when no 
forces act on the gas. However, there are numerous cases when the 
consequences of the assumption regarding the chaotic nature of 
thermal motion are not evident in advance. 

First, there is the question of the distribution of the molecules 
according to speed. What percentage is moving rapidly, and what 
percentages are moving with average or slow speeds? Then, there 
is the problem of determining how a uniform density distribution 
of molecules changes when the gas is placed in a field of force, say 
in a gravitational field—or, if the molecules have electric or magnetic 
properties, in an electric or magnetic field. Boltzmann’s law, which 
may be derived by means of the theory of probability, gives the 
answers to these and similar questions. 
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Let us consider a small volume of space—a cube at point x, y, z 
whose sides are Ax, Ay, Az. Assume a considerable number of mole- 
cules to be contained in this cube. We shall consider those molecules 
having velocity components in the ranges from v, to v,+Av,, v, to 
v,-+-Av, and v, tov,+Av,. The magnitudes of Av,, Av, and Av, are such 
that a large number of molecules are contained in the indicated in- 
terval of velocities. This is necessary in order to be able to apply the 
laws of statistical physics to these small volumes (physically, in- 
finitesimal volumes). In the future, we shall say that such molecules 
have coordinates in the neighbourhood of x, y, z and velocities in the 
neighbourhood of v,, v,,v,. We repeat, to speak of a quantity of mole- 
cules that have exactly a given velocity is impermissible, for the prob- 
ability of encountering such a molecule is infinitely small. Since 
the kinetic energy of a molecule is determined by the value of the 
velocity and the potential energy of a molecule in an external field 
depends on the coordinates of the molecule in space, all the mole- 
cules segregated by us have, practically, one and the same energy @. 

Boltzmann’s law, based on considerations developed in courses 
on theoretical physics, gives a general expression for the number of 
molecules whose coordinates are in the neighbourhood of x, y, z and 
velocities in the neighbourhood of v,, v,, v,. This number is 


An = Ae—*'?"AxAyAzAv,Av yAv,, 


where A is a constant that may be determined for a concrete problem, 
T is the absolute temperature and & is Boltzmann’s constant. 

The energy in the exponent is equal to the sum of the kinetic 
energy of translation of the molecule and the potential energy of the 
molecule in the external field: 


2 
= = +U. Hence, 
mou? 
“ped 


An=Ae *" AxAyAzAv,Av,Ag,. 


This formula also applies to the case when the molecule possesses 
other forms of energy as well, for example, rotational and vibration- 
al. These components of the energy must then be included in @. 

Boltzmann’s law or, as it is also called, Boltzmann’s distribution, 
shows that the largest energy corresponds to the lowest number of 
particles whose velocities and coordinates lie in the given interval. 

We shall apply Boltzmann’s law to the solution of important prob- 
lems related to the height distribution of particles and the velocity 
distribution of molecules, 
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73. Distribution of Particles with Respect to Height 
in a Gravitational Field 


If, in a liquid, there are a large number of small particles that are 
heavier than the liquid and do not dissolve in it, at first glance it 
may appear that sooner or later these particles must fall to the bot- 
tom. This, however, does not occur—but it would if there were no 
thermal motion. 

Thus, the force of gravity attracts the particles downwards, but 
the chaotic thermal motion, an inherent property of all particles, 
will continuously impede the action of the gravitational force. A 
particle moving downwards may experience a collision on the way 
that hurls it back upwards. It again begins to move downwards and 
again a collision may hurl the particle upwards or sidewise. While 
some particle may succeed in reaching the bottom of the vessel, 
another particle, on the other hand, may be raised from the bottom 
by random impacts and brought to the upper layers of the liquid 
by random impulses. It is quite understandable that as a result 
some nonuniform distribution of particles is established. In the 
-upper layers there will be the least number of particles, while at 
the bottom of the vessel there will be the greatest number. The 
heavier the particles and the lower the temperature, the more will 
the height distribution of particles be “compressed toward the bot- 
tom”. 

The quantitative aspect of this interesting phenomenon, occur- 
ring for all particles located in a gravitational field (molecules of 
a gas or particles of an emulsion suspended in a gas or liquid) becomes 
clear from Boltzmann’s law. We may rewrite the exponential factor 
in the formula for the Boltzmann distribution in the form 


mv? mgh 


U, the potential energy of gravitation, has been replaced by the 
expression mgh. Now, we are interested in the number of molecules 
(of all velocities) located at a height between 4 and A-+-AA. It is 


mgh 


An=n,e * Ah, 
Here, the coefficient of proportionality n, is simply-the specific num- 
ber of particles ae at h=0. Fig. 86 shows how the number of parti- 
cles decreases with increasing height. 
The form of this relation confirms the correctness of the assertion 


made above that the greater the mass of the particles and the lower 
the temperature, the more rapidly does the curve fall. It is also 
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evident from the curve that its rate of decrease depends on the gray- 
itational acceleration. On different planets, the distribution of 
particles with respect to height will differ. 

According to the above formula, at least a small number of mole- 
cules exists at every height above the Earth’s surface. This means 
that molecules may recede from the Earth and fly into space, for it 

is not excluded that as a re- 
an sult of random collisions one 
a or another molecule will at- 
tain a velocity of 11.5 km/sec, 
which is sufficient, as we 
know, to escape from the 
Earth’s gravitational pull. It 
may, therefore, be stated that 
the Earth is gradually losing 
its atmosphere. However, cal- 
culation of the rate of dis- 
persion of the atmosphere 
shows that it is negligible. 
Fig. 86 During the entire existence of 
the Earth, an_ insignificant 
amount of air has been lost. The situation is different as regards 
the Moon, where the velocity required to overcome gravity is 
~ 2 km/sec. Such a small velocity is very easily attained by mole- 
cules and as a result the Moon has no atmosphere. 

The formula giving the number of particles as the function of 
height may be rewritten for the density of a gas or for the pressure 
of a gas. Since the gas pressure is proportional to the number of 
particles in a unit volume, the formula may be written in the form 

mgh 


P=P,é@ nes 


Here, p, is the pressure at zero level. This formula is called the 
barometric formula. It is used by meteorologists measuring atmospher- 
ic pressure at high altitudes to reduce the results of their measure- 
ments to “sea level”. 


It is necessary to note yet another important application of the formula for 
the distribution of particles with respect to height, which was used for the ex- 
perimental determination of Avogadro’s number by Perrin, the French scien- 
tist. In accordance with the conditions of the experiment, Perrin had to some- 
what modify the formula for the distribution of molecules with respect to height. 
He studied an emulsion obtained by dissolving gutta gamba (a variety of resin) 
in water. Using a microscope, an entire mound of spherical granules could be 
observed in the emulsion. Perrin used a centrifuge to sort the gutta gamba 
granules according to size. Several months of labour yielded 20-30 gm of 
gutta gamba granules having a diameter of 0.74 microns. The density of gutta 
gamba is D=1.195 gm/cm‘, i.e., the mass of one grain was equal to 7X 10~™ gm. 
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Exact determination of the dimensions of the grains was no easy task. Perrin 
made this determination using three independent methods: 

1) The length of a chain of several dozen contiguous grains was determined 
under a microscope. 

2) The weight of several thousand grains was measured and the dimension 
calculated from the known density of gutta gamba. 

3) Stokes’ formula (see p. 218) was used to determine the dimension from ob- 
servations on the velocity with which a cloud of grains sinks in an emulsion. 
It was assumed that, in accordance with Archimedes’ principle, a grain sinks 


under the action of the force Anr(D—dg, where d is the density of the liquid 


and r is the radius of the grain. When the grain sinks uniformly, this force is 
balanced by the force of viscous friction calculated by Stokes’ formula. From 
this condition, it is easy to determine r. 

There was close agreement between the results of all three methods. This 
signified that the effective weight of a microscopic granule floating in a liquid 


may be written in the form mg Co . Recalling that =F we obtain the 
following barometric formula for an “atmosphere” of gutta gamba grains float- 


ing in water: 
—Nmgh ()_ @ 
D ) 


n=ne 


The experiment reduced to the determination of the ratio of concentrations n 
at equal levels. This was accomplished by focussing the microscope on _ suffi- 
ciently thin layers of the emulsion and calculating the number of particles in 
the field of vision for equal intervals of time. Perrin changed the viscosity of 
the emulsion by a factor of one hundred and observed that the ratio of concen- 
trations exactly agreed with the barometric formula. Substituting the values 
of ny, n, h, m, d, D and T, it was possible to determine N. It turned out that, in 
spite of the large changes in the viscosity of the emulsion and the dimensions 
of the grains, N determined in this manner agreed excellently with the values 
predicted by the kinetic molecular theory. Perrin obtained 6X 10%< N<7~x 107%, 
while according to modern data N=6.02472x 10%. This was highly reliable 
evidence that the Boltzmann distribution according to energy is applicable 
alee to particles having a gram molecule (mass of N particles) equal to 50,000 
ons! 


74. Velocity Distribution of Molecules 


The velocity distribution of molecules, first determined theoreti- 
cally by Maxwell, an outstanding English physicist, may be consid- 
ered to be a consequence of Boltzmann’s law. 

According to Boltzmann’s law, the number of molecules whose 
velocities are in the interval from v, to v,+Az,, v, to v,+Ad,, 
and v, to v,+-Av, is 

mo 
An=Ce ?*7 Av, Av,Av,. 


It is implied that we are interested in the velocity distribution in 
a small volume of gas and that the space distribution of the molecules 
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is taken into account by the constant factor C, which does not inter- 
est us at the moment. 

The above formula takes account of the distribution of the mole- 
cules with respect to the magnitudes as well as the directions of the 
velocities. However, we already know the distribution with respect 
to the directions—the number of molecules moving in one or an- 
other direction must be the same for complete randomness in the mo- 
lecular motion. We are interested in the number of molecules hav- 
ing a speed from v to v+Av, where 


— 2 2 2 
v=yf vito +o 


If we construct a three-dimensional diagram, along whose axes 
Uy) Uy, Uz, the projections of the velocities of the molecules, are plot- 
ted, “and consider this space to be divided into infinitely small cubes 
of volume Av,Av,Av,, the data on the velocity distribution of mole- 
cules may be simply represented as the numbers of molecules contained 
in a cube. Boltzmann’s formula gives us the number of mole- 
cules for each one of the cubes. However, examining the formula, we 
see that the number of molecules is the same for all cubes located 
within a spherical shell of radius v to v-++Av, for only the absolute 
value of the velocity enters in the exponential factor of the formula. 
The number of molecules having velocities in the range from w to 
v+Av is proportional to the volume of the spherical shell, i.e., 
4nv*Av. Thus, if the number of molecules contained in one cube is 
equal to 


mu? 


Ce 2*F Av, Av,Av,, 


the number of molecules contained in the spherical shell, i.e., 
possessing velocities in the range from v to Av, is represented by the 
formula 


mo? 


An=Ce ®4T hati: 


What then is the nature of this dependence? At v=0 and v=oo 
the number of molecules is equal to zero. It is evident that the curve 
must have a maximum. Let us determine in the usual manner the 
maximum of the factor preceding Av. Taking the derivative of this 
expression and setting it equal to zero, we obtain 


mu 


Ze BF y%)=0. 


Hence, the value of the velocity for which the distribution function 
has a maximum is 

QeT 

“m. 


C= 
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What can be said about this velocity? Since the number of mole- 
cules having the velocity v are plotted along the ordinate of the 
distribution curve, c is a peculiar boundary. Molecules moving with 
velocities greater or less than c are encountered less often than mole- 
cules of velocity ¢ 

This velocity is called the most probable. The velocity distribu- 
tion curve for gas molecules (Maxwell distribution) is shown in 
Fig. 87 

fi is interesting to compare the formulas for the most probable 
velocity and the r-m-s velocity: 


— ye and Urms = y=. 


We see that the r-m-s velocity is greater than the most probable. 
The reason for this is evident from the form of the distribution curve. 
Since the curve extends far to 
the right, the root-mean- 
square velocity is displaced in 
that direction. 

Let us cite several figures 
characterising the velocity 
distribution of gas molecules. 
The number of molecules with 
velocities close to the most 
probable velocity ¢ is 1.1 
times larger than the number Fig. 87 
of molecules with velocities 
close to the root-mean-square value, 1.9 times larger than the num- 
ber of molecules with velocities close to 0.5c, and 5 times larger 
than the number of molecules with velocities close to 2c (see Fig. 87). 
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75. Measurement of the Velocities of Gas Molecules 


Even though the law of molecular velocity distribution is based 
on exceptionally well-founded theoretical grounds, whose validity 
is confirmed by a large number of physical facts, it is interesting to 
subject the distribution formula to direct experimental verifica- 
tion. 

The velocity of gas molecules can be measured in a volume only 
by indirect means. If a molecule radiates light, the velocity of its 
motion affects the width of the spectral lines (Doppler effect). 

Direct means of measurement require molecular beams. For this 
purpose, a long tube of large diameter is partitioned by two shutters 
having very small apertures. The gas is placed in an end compart- 
ment, whereupon the molecules begin, at first, to penetrate into the 
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middle compartment and will sometimes even reach the compart- 
ment at the other end. Clearly, only those molecules whose vector 
velocities are directed along the axis of the tube when passing through 
the first aperture can traverse the entire length of the tube. Thus, 
a molecular beam is separated out from the gas. The velocities of the 
beam molecules all have the same direction. It is evident, however, 
that due to the random motion of the molecules, the distribution 
with respect to speed will be the same as for the molecules of any 
other direction of motion. 

To measure the velocities of the beam molecules, we can resort to 
an arrangement reminiscent of an apparatus used for measuring the 
velocity of a bullet. Such an apparatus has two cardboard disks 
rigidly fixed on a shaft and rotate about it with a velocity o. If 
the bullet travels parallel to the axis of rotation, the disks will be 
consecutively pierced at two points displaced in azimuth by an angle 
@ with respect to each other. This angle corresponds to the rotation 
of the system while the bullet traversed the distance J between the 


disks. The rotation time for angle @ is equal to 2. Hence, the veloc- 
ity of the bullet is 
9 _ lo 


v=l1 = 


A 7 


Since molecules cannot pierce disks, the analogous experiment 
for molecular beams is performed with disks in which slits are cut 


along radii. The angular distance between the slits is equal to gq. 
Clearly, molecules of velocity v can pass through two slitted rotating 
disks only for a specific angular velocity , satisfying the condition 
ta, Thus, by varying , we can filter the molecules according 
to their velocities, collect molecules having the same velocities and 
measure their relative quantities. 

The velocity distribution formulas discussed above, and hence the. 
formulas for the r-m-s and most probable values of molecular veloc- 
ities, have been verified by numerous experiments. 


76. Probability of a State 


Let us consider a box divided into two equal parts by a partition 
in which an aperture has been cut. If there are molecules of gas in the 
box, they may be transferred from one half of the box to the other 
as the result of random collisions with the walls of the container and 
with each other. 

In spiteof thefact that the molecular motion iscompletely haphaz- 
ard, a method exists for predicting how many molecules, will be 
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in the left half of the box and how many in theright half. This meth- 
od is based on the application of the theory of probability. 

If there were one molecule in the box, the chances or, as we say, 
the probability, that the molecule is in the right-hand portion is 
the same as that it is in the left-hand portion. Since, in all, there are 
two possible cases (the molecule is either in the left-hand or in the 
right-hand portion), and we are interested in the realisation of one 
of these cases, the probability of the molecule being in one half of 


the box is said to be equal to = Now, assume that there are two mol- 


ecules in the box, designated by the figures 1 and 2. In all, there are 
now four possible dispositions—both molecules on the left, both on 
the right, molecule No. 1 on the left and No. 2 on the right and, 
finally, No. 2 on the left and No. 1 on the right. We are interested in 
the probability of finding two molecules on the left. This is one case 


out of four possible ones, so that the probability is equal to a 


i.e, (+). For three molecules, the situation is as follows: 


left 1, 2,3 0 12 1,3 2,3 3 2 1 
right 0O 1,2,3 3 2 1 1,2 1,3 2,3 


Clearly, the probability of all three molecules being on the left is 
- i.e., (5 . It is not difficult to see that, for the case of N mole- 
cules, the probability of all the molecules being in one part of the 
box is equal to (5 wa Whenever another molecule is added, it is 


always possible to place it either in the left-hand or in the right-hand 
part. Therefore, with each newly added molecule, the probability 
of the molecules being in one hali of the container is obtained by di- 
viding the preceding probability by two. 

When the number of molecules is still no more than one hundred, 


the quantity (s)" is already so small that we need no longer take 


account of the possibility that all the molecules will be located in 
one half of the container. However, the number of molecules in a 
cubic centimetre of gas is not one hundred, but about 107°. If the 
container is considered to be divided into two parts, the probability 
that the molecules will all turn up in one half of the vessel is equad to 

> a By taking the logarithm, this number may beconverted into 
the form 107*X1"', To put this number in decimal form, 3x10" 
zeros must be written! A person writing at a rapid speed of three zeros 
per second will require 10'’ sec to write this number. This is equiv- 
alent to 300,000 million years, which is ten times the amount of 
time our solar system has been in existence. 
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Let us return to the table for the disposition of three molecules. 
Only for one disposition out of eight do all the molecules turn up on 
the left. Every other disposition is also encountered one time out of 
eight. It should be remembered, however, that the molecules are ar- 
bitrarily numbered and there is no way of differentiating a disposition 
in which Nos. | and 2 are on the left from one in which 2 and 3, or 1 
and 3, are on the left. Thus, compared to the one disposition in which 
there are three molecules on the left, there are three dispositions in 
which there are two molecules and the same number of dispositions in 
which there is one molecule on the left. Therefore, the probability of 
some characteristic distribution existing, regardless which particular 
molecules produce it, may be measured by the number of disposi- 
tions that can produce the distribution. The greater this number, the 
more frequently will such a distribution be encountered, i.e., the 
more probable will this distribution be. 

The smallest piece of matter consists of a tremendous number of 
molecules. In their haphazard motion, the molecules are endlessly 
changing their location and velocity. However, if the substance is 
in thermal equilibrium with the surroundings, the measurable mac- 
roscopic characteristics of the substance (i.e., the pressure, temper- 
ature or energy, etc.) remain unchanged. One and the same state 
is achieved at every instant in another manner, i.e., the molecules 
are located differently (with respect to the adopted numeration of 
the molecules) and the velocities are different each time (again with 
respect to the numeration). One and the same state is achieved by 
means of a tremendous number of dispositions of the numbered mole- 
cules, but the space and velocity distribution of the number of 
particles remains unchanged. 

The number of dispositions of the molecules leading to one and 
the same space and velocity distribution and, hence, to one and the 
same state of a substance relative to the total number of dispositions 
of the molecules is called the probability of the state. 

If there are only several molecules in a volume under considera- 
tion, the state of this system will continuously change. Not only 
will there be probable states, but sometimes less probable states will 
also be encountered. Thus, in the case of three molecules, all the mol- 
ecules will be in one part of the vessel, on the average, in one case 
out of eight. However, as the number of molecules increases, the 
“weight” of the most probable state also iricreases. The probability 
of achieving the more probable states becomes immeasurably greater 
than the probabilities of other states. To make this clear, let us re- 
turn to the example of the molecules in a box, but assume that the 
box is divided into a thousand million cells. What is the probability 
that N gas molecules are distributed in all the cells except one. Using 
the same reasoning as previously, it is not difficult to conclude that 
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this probability is equal to (0.999999999)". At first glance, it may 
appear that this will yield a value close to unity. However, taking 
N =10°°, the approximate number of gas molecules in one ‘cubic 
centimetre, we find that the probability is equal to 10—-4*!0", 

This calculation shows that the probabilities of the most probable 
states of a substance and the probabilities of improbable states are 
entirely incommensurable quantities. That is why a state of thermal 
equilibrium is macroscopically stable and the parameters of a sub- 
stance in a state of equilibrium do not change with time. 


77. Irreversible Processes from the Molecular Viewpoint 


The example of the previous article shows quite clearly that the 
state in which the molecules are distributed “uniformly” is the most 
probable. Any deviation from “uniformity’—displacement of one 
portion of the molecules to the left side of the container, disposition 
of the faster molecules on the left, directed motion of a large portion 
_ of the molecules, in short, any deviation from haphazardness in the 
space and velocity distribution of the molecules—results in a decrease 
in the probability of the state. This conclusion enables us to compre- 
hend the kinetic-molecular nature of the irreversibility of actual 
processes. 

As was established above, the second law of thermodynamics 
for irreversible processes, i.e., the law of increasing entropy in 
thermally isolated systems, is a generalisation of experimental ex- 
perience on the impossibility of certain processes. Thus, heat cannot 
be transferred from a cold body to a hot one without compensation, 
a body cannot acquire kinetic energy merely at the expense of a 
decrease in the internal energy of the surrounding medium, and a gas 
may expand of itself but not contract. 

The existence of irreversible processes is a peculiarity of molecular 
phenomena. For a purely mechanical phenomenon, i.e., a process 
without friction, the process may always be reversed. When a pen- 
dulum moves to the right, it passes in reverse order through all the 
states passed in moving to the left. A billiard ball rebounding from 
the side of the table in some direction or other will, in turn, rebound 
from an elastic wall placed in its path and retrace in reverse order the 
entire path traversed in the “forward” direction. The complete equiv- 
alence of “forward” and “backward” is evident for purely mechan-* 
ical processes. Why then do molecular processes, which we -have 
- considered as the totality of the movements of the molecules, not 
possess the property of reversibility. There is only one reason. In 
all irreversible processes, the probability of the state increases. 
A reversible process is a conceivable process, i.e., its implementa- 
tion is in principle possible, but for the time available to a person 
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for observation such a processis, practically speaking, improb- 
able. 

This is not difficult to show for any irreversible process. Heat 
transfers from a hot body to a cold one, but not vice versa. In the 
case of gaseous bodies, such a process may be visualised as a mixing 
of fast molecules with slow ones. The reverse process cannot occur 
according to the random law, for it would constitute a sorting out 
of fast and slow molecules, i.e., a transition to a more orderly 
state. 

For the same reason, using a shovel, we can quite rapidly mix the 
contents of two sacks of different grain. However, we can continue 
to mix the contents of these two sacks endlessly without the grain 
separating in such a manner that one of the grain varieties appears 
above and the other below. It should be realised, moreover, that the 
number of kernels in the sacks is immeasurably less than the number 
of molecules in a cubic millimetre. 

It is also easily seen that the reverse process of spontaneous ex- 
pansion of a gas is completely improbable. If in the partitioned box 
considered above there is gas on the left and vacuum on the right, 
both parts of the box will be uniformly filled with gas within a short 
time. In principle, it could occur that all the molecules turn up 
together again on the left. However, the probability of such an event 


is extremely small. Its value has been shown to be equal to (3)* 


No matter which irreversible process we consider, the result will 
always be the same—each irreversible process is accompanied by an 
increase in the probability of the state. 

Thus, there are two quantities that increase during irreversible 
processes—the entropy S, with which we are already familiar, and 
the probability of the state W, which we have just discussed. It 
is natural that these two physical quantities should be related. Boltz- 
mann showed that such a relation does, in fact, exist. The formula 
given by him has the form S =k In W, i.e., the entropy is proportion- 
al to the logarithm of the probability of the state. 

The second law of thermodynamics thus acquires still another 
formulation: In reversible processes, the probability of the state does 
not change, while in irreversible processes (we are referring to sclosee 
systems) the probability of the state increases. 


78. Fluctuations. Limits to the Application of the Second Law: 


All physical properties remain unchanged if the space and velocity 
distributions of the molecules do not change. In principle, the dis 
tribution of the molecules of a substance may change with time. How- 
ever, as we have indicated above, the most probable distributions 
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stand out so sharply that deviations from these distributions must 
be considered to be very rare events. The physical characteristics 
corresponding to this most probable distribution are called the 
average characteristics. Practically, the deviation of a measured 
physical characteristic from its average value for systems having 
large numbers of molecules is impossible to observe. This is the sit- 
uation when the physical properties are being considered for volumes 
containing large numbers of molecules. However, if the number of 
particles in a system is small, it is also possible to observe rarer 
space and velocity distributions of molecules. The values of the phys- 
ical characteristics corresponding to these rarer distributions differ 
from the average values. These deviations of the physical character- 
istics from their average values, which occur in systems having a 
relatively small number of particles, are called fluctuations. All 
properties of volumes containing small numbers of molecules—e.g., 
temperature and pressure, thermal capacity and thermal conduc- 
tivity—are subject to fluctuations about the average values. 

This question may be approached somewhat differently. If a tiny 
mirror suspended from a thin string is placed in a gaseous medium, 
then, from the macroscopic standpoint, the pressure of the gas acting 
on the mirror cannot manifest itself, for the forces act on all sides 
equally. In principle, from the molecular standpoint, the changes 
in momentum due to the impacts of the molecules on the mirror do 
not necessarily have to balance for the different portions of its sur- 
face. A light mirror may thus begin to execute fluctuational vibra- 
tions. As was stated above, for any particle (molecule, Brownian 
particle, pea), the energy of thermal, random motion is equal to 


+ kT for one degree of freedom of motion. And this is the energy, on 


the average, falling on the mirror. On the other hand, the work of 
rotating the string by an angle Ag is equal to MAg. Therefore, ang- 


ular deflections of theorder of magnitude Agxit will occur quite 


often. 

Such fluctuations are indeed observed and their measurement may 
be used for the experimental determination of Boltzmann’s constant 
and, hence, of Avogadro’s number. 

Fluctuational effects limit the accuracy of measurements. A point- 
er, mirror, or any other indicating device is subject to fluctuations. 
At room temperature, the accuracy limit in energy units is about 
10-*° joule. In the construction of many instruments such accuracy 
has not yet been achieved, but in some of the best measuring devices 
it already has. ; 

Fluctuations limit the applicability of the second law of thermo- 
dynamics, They represent processes in which the system passes from 
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a more probable state to a less probable one, i.e., processes in which 
the entropy decreases. 

This is excellently illustrated by Brownian movement, where the 
pressure fluctuations occur in small volumes and affect individual 
particles. Due to these random pressure vibrations, a particle may, 
for example, be impelled upwards. However, motion against the 
force of gravity requires work. In this case, the work is performed at 
the expense of the random, thermal motion of the molecules, i.e., 
only at the expense of the internal energy of the substance, which 
is at complete variance with the second law of thermodynamics. 

Although phenomena for which the entropy decreases occur at 
times in individual small volumes, i.e., the second law of thermody- 
namics is contradicted, the system as a whole will always obey this 
law. Due to the randomness of the events, the number of processes 
occurring at the expense of the internal energy will be exactly equal 
to the number of processes occurring in the reverse direction. It can 
be rigorously proved that any attempt to create a perpetual engine 
of the second kind by “selecting” in individual small volumes processes 
that contradict the second law will end in failure. 

The second law of thermodynamics has a limit at the other end of 
the scale as well. In addition to being inapplicable to systems having 
a very small number of particles, it loses validity for systems having 
an infinitely large number of particles, namely, the universe or any of 
its infinitely large components. As was explained above, the essence 
of the second law of thermodynamics consists in the fact that the num- 
ber of equilibrium states is overwhelmingly greater than the number 
of nonequilibrium distributions. However, for the universe, which 
consists of an infinitely large number of particles, this statement 
loses its meaning, for the number of equilibrium states and thenum- 
ber of nonequilibrium states are both infinitely large. Consequently, 
for the universe as a whole, one cannot speak of the differences in 
the probability of states. 

This is confirmed by general philosophical and physical consider- 
ations first advanced by Engels and Boltzmann, respectively. As 
a matter of fact, if we assume that the second law of thermodynam- 
ics is applicable to the entire universe, then it naturally follows 
that the universe is approaching a “thermal death”. It implies that 
sooner or later “the most probable” state of thermal equilibrium will 
be established in the universe, whereupon further processes will 
cease. This notion, first advanced by Clausius, is at variance with 
the materialist view that the universe is eternal in time. 


CHAPTER XIII 
PROCESSES OF TRANSITION TO EQUILIBRIUM 


79. Diffusion 


A body interacting with its medium changes its stateso as to come 
into equilibrium with the bodies surrounding it: its internal 
energy tends to a minimum, while its entropy increases and becomes 
a maximum when equilibrium is established. These two tendencies 
are usually conflicting. As a result it is difficult to predict the 
effect when both energy and entropy are capable of changing. Let 
us now consider the phenomena of diffusion, thermal conductivity 
and internal friction occurring in closed systems. In other words, 
we are concerned with equalisation of the concentrations, tempera- 
tures and velocities of some parts of a body with respect to others. 
(Naturally, equalisation of velocities is meaningful only for liquids 
and gases.) Since the energy cannot change in such systems, the tran- 
sition to a state of equilibrium consists only in an increase in entropy. 

The basic laws for the phenomena of difiusion, thermal conductiv- 
ity and internal friction are very similar. Let us first consider diflu- 
sion processes. It is immaterial whether we deal with the equalisa- 
tion of the concentration of a gas or a liquid. Our discussion will 
even be valid for solid solutions (see p.614), since in this case too the 
tendency to maximum entropy makes the atoms or molecules of 
a substance intermix so that a single concentration is established 
in all parts of the body. 

Let us consider two physically close, infinitely small volumes of 
a substance whose concentrations of diffusing atoms (or molecules) 
are c and c+dce. If these two volumes are a distance dx apart, the ratio 
Me gives the rate of change of concentration. This ratio is called the 
concentration gradient. If the x-axis is chosen so that its positive 
direction coincides with the diffusion direction, then ae will be a 
negative quantity. Substance will migrate in the direction of lower 
concentrations. 

This does not mean that all the molecules move in one direction 
in a continuous, uninterrupted stream. On the contrary, diffusion 
maintains to a considerable extent the haphazard features peculiar 
to molecular motion. The molecules move haphazardly in all direc- 
tions, including the direction of greater concentration, but the prob- 
ability of molecules being displaced in the “correct” direction is 
greater than for other directions. This means that through an area 
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perpendicular to the flow more particles pass in the direction from 
greater concentration to less concentration than the other way 
round. 

The basic law of diffusion states that the flow of matter p, i.e., 
the mass of matter passing through a unit area per unit time, is di- 
rectly proportional to the negative gradient of the concentration: 


D, the constant of proportionality, is called the coefficient of diffu- 
sion. The above relation is convenient since the coefficient of diffusion 
is, within broad limits, a constant for a given substance and medium. 
In measuring the concentration and the flow of matter, the units 
should correspond. Thus, if the concentration is measured in grams 
per cm’, the flow should be measured in grams per cm? per sec. We 
see then that the dimensions of the coefficient of diffusion are com- 
pletely determined and in the 
d CGS system are expressed in 
tanot= = <O cm?/sec. 
x : : 
A decrease in concentration 
usually follows a sagging curve 
as shown in Fig. 88. If we are 
interested in the portion for 
which the decrease in  con- 
centration may be represented 
by a straight line, then 
oes Co— Cy 
Lt=— D Xe—%4 ’ 
where c, and c, are the values 
Fig. 88 of the concentrations at points 
Xx, and x2, respectively. 





The coefficients of diffusion vary within broad limits. For example: 
1) for gases at temperatures from 0° td 15°C: 


hydrogen —+ oxygen, D=0.778 cm?/sec; 
air —, oxygen, D=0.178 cm?/sec; 
air—,» carbon disulphide, D=0.099 cm?/sec; 


2) for solutions of blue vitriol diffusing in distilled water (where c is in 
gram equivalents per litre): 


ce D (cm? ‘day) 
0.1 0.39 
0.5 0.29 
0.95 0.23 
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80. Thermal Conductivity and Viscosity 


The temperature equalisation process is very similar to that of 
diffusion. If the temperature of a body differs at different points, the 
entropy is not amaximum. In order for equilibrium to be established, 
the average velocities of the molecules, and hence the tempera- 
tures, must become equalised. 

If the temperatures at two neighbouring points separated by a dis- 


tance dx are T and T-+dT, the ratio an expresses the rate of tem- 


perature drop and is called the feather deur gradient. 

During the process of temperature equalisation, portions of 
the body having more energy give up energy to portions of the 
body having less energy. In acertain sense, heat “flows” from one 
portion to another. The amount of heat passing from one portion of 
the body to another through a unit area per unit time is called the 
heat flow q. Just as in the case of diffusion, one can assume that the 
heat flow is proportional to the negative temperature gradient. 
The greater the temperature difference, the more rapid the heat 
flow. The formula 


is also convenient here too since the constant of proportionality x, 
which is called the coefficient of thermal conductivity, is a constant 
for a given substance and does not depend on the magnitude of the heat 
flow. For a linear temperature drop, the formula assumes the sim- 
plified form 
ods T,—T, 
ames ener 
It is not difficult to determine the dimensions of the coefficient 
of thermal conductivity. In the CGS system, this coefficient is meas- 
ured in cal/em sec deg. It is evident from the formula that x is 
the heat flow per second through an area of 1 cm? when the temper- 
ature drops 1° over a distance of 1 cm. 


The values of the coefficient of thermal conductivity, just as in the case of 
the coefficient of diffusion, vary within broad limits. 

For example: 

1) for solid bodies (0°-18°C): cork—0.00012, wood—0.0008, fused quartz— 
0.0033 and silver—1.06 cal/em sec deg 

2) for liquids: carbon bisulphide (14°C) —2% 107, sulphuric acid 30% 
(82°C)—62.4X 1074, mercury (0°C)—0.2 cal/em sec deg. 

3) for gases (O°C): carbon dioxide—3.4x 107§, pe 7X 1075, hydrogen— 
40.6X 107° cal/em sec deg. 


8* 
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The third phenomenon of this class relates to the equalisation of 
velocities. If a gas or liquid moves in some direction or other so 
that different layers of the substance move at different velocities, 
the motion is unstable. Sooner or later, the velocities must become 
equalised—the slower layers are accelerated and the faster ones are 
retarded. This phenomenon is also called internal friction or viscosity. 

Let us consider a liquid or gas moving along an x-axis. Assume 
that different layers of the fluid are moving at different velocities. 
Along the y-axis, perpendicular to the direction of flow of the liquid 
or gas, take two close points separated by the distance dy. The ve- 


locities of flow differ at these two points by du. Thus, the ratio a is 


the gradient of the velocity and expresses the rate of change of the 
velocity as we move away from the surface of the fluid. To make 
this clear, consider a rapid stream. The velocity of flow is a maxi- 
mum at the surface and gradually decreases as the bottom of the 
stream is approached. 

If at some instant we eliminate the causes of the fluid motion, the 
velocities of the different layers will begin to be equalised in accordance 
with the law of increasing entropy. In order for such an equalisation 
to be possible, an internal frictional force must exist between the layers 
of the liquid or gas. The magnitude of this force per unit layer 
area is proportional to the gradient of the velocity, i-e., 


du 
fag 


Here, n is the coefficient of viscosity (or internal friction). Its dimen- 
sions in the CGS system are gm/cm sec. Such a unit is called a poise, 

The viscosity of different bodies varies within even broader limits than the 
two analogous coefficients considered above. For example: 

1) Solid bodies: glass (710°C)—4.5X 10", glass (420°C) —4 X 1018, lead (9°C) — 
4.7X 10%, ice (—14°C)—8.5xX 101? poises 

2) Liquids: oy ether (25°C) —0.0022, ‘water (20°C) —0.01, glycerine (0.8% 

water, 18°C)—13.93  poises. 

3) Gases: hydrogen (0°C)—8.49X 1075, air (0°C)—17.19X 1075 poise. 

It is interesting to note that hydrogen "has one-half the viscosity of air and 
seven times its thermal conductivity. That is why hydrogen is used to cool 
powerful turbogenerators. 


81. Rate of Equalisation 


We all know that the time it takes for equilibrium to be estab- 
lished varies within broad limits. The temperature of ared-hot piece 
of iron thrown into water and the temperature of the water become 
equalised rapidly. On the other hand, the temperatures of air and a 
hot brick become equalised slowly. Nitrogen diffuses almost instan- 
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taneously in oxygen, while the equalisation of the concentrations 
of a solution of blue vitriol takes many days. Similarly, the 
equalisation of velocities also varies within broad limits, depending 
on whether we are dealing with a gas or a viscous liquid. 

A universal formula for the time of equalisation cannot be given, 
for the geometry of the object affects the time. A cooling body may 
have the form of a cylinder or a plate; a diffusing gas may initially 
be within a small spherical volume or distributed over some surface; 
and the internal friction may occur in pipes of various cross-section 
or in open reservoirs. The particular circumstances must be taken 
into account in each case and exact calculation of the value for the 
time of equalisation is a difficult mathematical problem. However, 
we can abstract from the geometric details and try to solve the 
problem in general form if we abandon the aim of obtaining an 
exact formula and are content to merely determine the proportion- 
ality between the physical quantities. In this connection, it is 
helpful to consider the dimensions of the physical quantities that 
should be related to each other. 

Let us consider, for example, the phenomenon of diffusion. It 
is evident that ¢, the time of concentration equalisation, depends, 
in the first place, on the dimensions of the region in which the diffu- 
sion occurs (characteristic length L) and the properties of the diffus- 
ing substance (characterised by the coefficient of diffusion D). The 


diffusion equation has the form p= ae The dimensional equa- 


tion is then 
M M 
Cr [D] “Li * 


We see that T = a ,i.e., the time of equalisation t= K+ and does 


not depend on the concentration. ; 

Therefore, the following conclusion may be drawn: Every rigorous 
solution determining the time of concentration equalisation for 
diffusion processes will always yield the equation 


2 
i=K—, 


where K is a constant, dimensionless quantity depending on the 
geometric conditions of the problem. The quantity L, on whose 
square the speed of concentration equalisation depends, refers to the 
geometric dimension of the region in which the equalisation occurs. 
Thus, if the concentration becomes equalised within the limits of 
one centimetre in, say, 10 sec, then, within the limits of two 
centimetres, it will become equalised in 40 sec. 
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We can solve the problem of temperature equalisation in exactly 
the same manner. The basic relation of this phenomenon includes the 
following quantities: heat, coefficient of thermal conductivity, tem- 
perature and distance. However, the increment of heat per unit 
volume may be written in the form 


dq= ec,d T, 


where c, is the specific heat at constant pressure and g is the density 
(thus, c,@ is the thermal capacity of a unit volume). Therefore, the 
following quantities must be related to each other: temperature, 
length, time, density, thermal capacity and thermal conductivity. 
It is easily verified that the time ¢ cannot depend on the tempera- 
ture and can only be expressed by the other quantities as follows: 
L'QCp 
“* 
Thus, the time of temperature equalisation is expressed by the 
formula 


where y designates the combination of constants ~, The quantity 


y is called the thermometric conductivity and has been introduced in 
order to put the formulas for the equalisation of concentration and 
temperature in similar form. The coefficients of diffusion and ther- 
mometric conductivity have the same dimensions and are completely 
analogous in the two equalisation phenomena considered. 

We thus see how the cooling of a body is determined. The greater 
the density and thermal capacity, and the smaller the coefficient of 
thermal conductivity, the slower the process. 

Example. Let us compare two rods of identical dimensions made of fused 
quartz and silver, respectively. For quartz, x=0.0033 cal/cm sec deg, 9e=2.65 
gm/cm’, and Cp=0.1844 cal/gm deg, i.e., y=0.676X 107? cm?/sec. For silver, 
x= 1.06 cal fem sec deg, 9= 10.5 gm/cm* and c,=0.0558 cal /gm deg, i.e., y=1.71 
cem?/sec. Thus, the equalisation of temperature in the quartz rod takes 253 times 
longer than in the silver rod. 

Just as for diffusion, equalisation of the temperatures is character- 
ised by a dependence on the square of distance, i.e., the time of 
equalisation is proportional to the square of the linear dimension 
of the region. 

Without going through an analogous procedure, let us write the 
formula for the time of velocity equalisation for the various parts of 
a liquid or gas. It is not surprising that the form of this relation is 
similar, namely: 


2 
t=K->. 
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The coefficient v, which determines the rate of equalisation of the 
velocities, is equal to “ and is called the kinematic viscosity. 


Example. For water, n=0.01 poise and 9=1 gm/cm’, i.e., v=0.01 cm?/sec; 
for glycerine, n=13.9 poises and @=1.25 gm/cm’, i.e., v=11.1 cm/sec. This 
means that if a disturbance in glycerine is equalised in0.1 sec, the same disturb- 
ance in water will be equalised in about 2 minutes. 


82. Steady Processes 


If a body is left undisturbed, the differences in temperature con- 
centration and velocity of the various parts of the body will be equal- 
ised, to be sure, in accordance with the principle of increasing en- 
tropy. However, it is also possible to have a state of a body for which, 
over a prolonged period, the flow of heat or matter, or the velocity 
distribution of various parts of the body with respect to each other, 
remains unchanged. Processes of this type are called steady processes. 
Naturally, in the case of a steady process, the body is not in a state 
of equilibrium. : 

Under what conditions are such processes possible? Let us consider 
a metallic rod to which at each instant of time, a certain quantity 
of heat is supplied at one end of the rod, while the other end is in ther- 
mal contact with acold body. The condition under which the tempera- 
tures along the rod will not change, i.e., the condition of constant 
temperature gradient along the entire path of heat flow, is that the 
quantity of heat absorbed by the cold body be exactly equal to the 
aan of heat supplied by the hot body during the same period 
of time. 

Under analogous conditions, a steady diffusion process is also 
possible. To create such a process, a certain quantity of matter must 
be supplied to one part of a body and the same quantity must be 
removed from another part. In this manner, a constant difference of 
concentration is maintained between two parts of the body. 

A steady viscosity process may be obtained, for example, in the 
region between two coaxial cylinders rotating at different velocities. 
Since close to the solid surface the liquid or gas has the same velocity 
as the solid wall; a constant velocity gradient is created within the 
fluid. 

Steady processes do not arise immediately. A certain amound of 
time must elapse for such processes to become established. 

Let us assume that one end of a rod that transmits heat is placed 
in snow. At the initial instant, the temperature of the rod is equal 
to zero at all points. If the other end of the rod is then brought into 
thermal contact with boiling water, the temperature begins to rise 
throughout the rod, but, of course, not at the same rate at all points. 
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Almost immediately, a high temperature is established at the end 
of the rod that is in contact with the boiling water. The temperature 
of the end of the rod that has been placed in snow will rise slowest. 
After a certain period of time, the temperature will cease rising at 
all points of the rod and a definite temperature distribution becomes 
established, i.e., the process becomes steady. The nature of the tem- 
perature distribution depends on the amount of heat supplied (or 
removed) per unit time. 

In an electric iron heated by a spiral element, the highest temper- 
ature is in the central region and the temperature gradually drops 
towards the outer edges. Naturally, the air immediately sur- 
rounding the iron is hottest. With increasing distance from the iron, 
the temperature falls more rapidly owing to the low thermal conduc- 
tivity of air. 

In the case of small bodies in air or liquid, the temperature curve 
need not be considered in rough calculations, i.e., it suffices to deal 
with the temperature difference T—T, between the body and the 
medium. The heat flow per unit time from the body to the medium may 
then be assumed to be proportional to this temperature difference: 


q=k(T—T,). 


The coefficient & is called the coefficient of thermal output and is an 
important engineering quantity. In courses on heat engineering, 
values for this coefficient are determined and related calculations 
discussed. 

Let us designate by P the power supplied to a body, e.g., electric 
power in the case of an electric iron. The condition for a steady proc- 
ess requires that 


P=k(T—T,). 


Here, T is the body temperature established in this steady process: 
i Tan. It may vary considerably, depending on the power sup- 


plied and the conditions of heat exchange. 

It is appropriate at this point to comment on the temperature 
indicated by a thermometer placed “in the sun”. The thermometer 
is involved in the steady process of transferring solar heat to the sur- 
rounding air. Depending on the value of the coefficient of thermal 
output, the thermometer lying in the direct rays of the sun may 
indicate any value whatsoever. The temperature measured under 
such conditions is the temperature of the thermometer and is in no 
way an indicator of the weather. 

We shall not consider the analogous diffusion problems. 
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8&3. Motion in a Viscous Medium 


Consideration of the dimensions of physical quantities helps to 
solve problems of tremendous practical importance. One such 
problem is the steady flow of a liquid or gas around an obstacle or, 
what amounts to the same thing, the steady motion of a body ina 
medium. 

The most important problem concerns the resistance force 
experienced by a body in moving through a medium. This resistance 
force may depend on the body dimension L, the body velocity uw, 
and properties of the liquid (or gas), namely, its density @ and 
viscosity y. Other quantities should play no part in this process. 

Let us first consider the dimensionless quantity comprised of L, u, 


g and 4. It will be recalled that the kinematic viscosity, v=, has 


the dimension L?T~*. But the product Lu also has this dimension. 
Therefore, 
olu Lu 

is a dimensionless quantity. This quantity is designated as indicated 
and is called the Reynolds number. It can be shown that Re is, in 
effect, the only dimensionless combination of the indicated quanti- 
ties. Other dimensionless quantities can only be functions of the 
Reynolds number, i.e., f(Re). If the motions of different bodies in 
different fluids lead to one and the same value for Re, the motions 
are said to be similar. A large technical field founded on the prin- 
ciple of similitude has developed. In this field, the characteristics 
of a phenomenon are determined on the basis of observations of a 
similar phenomenon occurring in a model. 

Let us now return to the problem raised above, namely, finding 
the expression for the resistance force experienced by a body moving 
in a medium. 

The dimensions of force are given by MLT~’. This can be equated 
to the dimensions of the quantities with which we are dealing, since 
it cannot depend on any other quantities. Thus, 


MLT~*? =[o]* [us (L] [a], 
i.e., Say a8 
MLT7? = ML-" LST OL LT, 
Hence, 
atd=1, —3a+Bp+y—s=1, —p—b=—2. 


Expressing a, B and y in terms of 6, we obtain 
a=1—8 Bp=2—6, y=2—6. 
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Thus, in the most general case, F may be expressed in the form 
of a sum of terms, each of which has the indicated dimensions, i.e., 
a 1-3 ,2-57 2-837 4, 27 2 Lug ae 
F=AlQ?u?*L n]= eu L*A | ( <2) ie 


where A represents numerical coefficients. We have thereby demon- 
strated that the force must be given by the formula 


F= Kou’L’f (Re). 


This result has been obtained only by considering the dimensions! 
The function f(Re) is not known and must be determined experi- 
mentally. 

Definitive formulas for limiting cases may be obtained by simple 
reasoning. If the velocity is small, F must be proportional to the 
first power of the velocity u. For this to be true, f(Re) must be equal 


to aes and, therefore, 
Re 
F= Kyl. 


The numerical value of the constant depends on the form of the 
body. For a sphere, 
F=6nnur, 


where r is the radius of the sphere. The last formula is known as 
Stokes’ formula. 


Example. A mercury globule (r=0.53 mm), sinking in glycerine with a ve- 
locity of 0.6 cm/sec, experiences a force of friction of about 8 dynes, 


In the case of very large velocities, the fluid motion with respect 
to a body ceases to be steady. Eddies appear and the motion becomes 
turbulent. The body motion may be steady, but the fluid particles 
move more or less randomly. Owing to the intense nature of the 
disturbance, the transfer of motion from layer to layer ceases to 
depend on the viscosity. This can only occur if f(Re) approaches a 
limit as the velocity increases. Therefore, for large velocities, the 
resistance force becomes proportional to the velocity squared: 


F=Kon'l’. 


84. Coefficients of Diffusion, Viscosity and Thermal Conductivity 
for Gases 


The processes of equilibrium establishment in gases are closely 
related to the characteristics discussed in the previous article. Tem- 
perature, concentration and velocity equalisation of some parts of 
a gas with respect to others occur owing to the mixing of the molecules. 
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The rate of this mixing is determined by the role played by colli- 
sions between particles. For example, in case of a large free path, the 
fast molecules quickly penetrate into the regions where the slow 
molecules are located and distribute themselves throughout the gas. 

It is quite natural that the time of equalisation in all three proces- 
ses should be of the same order of magnitude as the time between 
molecular collisions. This may be verified by theoretical calculations 
for particular cases, but we shall not concern ourselves with this 
problem. 


Taking the equation for the equalisation time to be t=, where- 


by a dimensionless constant of proportionality whose order of 
magnitude is usually equal to unity is omitted, we obtain on the 
basis of Sec. 81 perfectly identical expressions for the coefficients 
of diffusion,* kinematic viscosity and thermometric conductivity 
(assuming L=1l): D~v~yr~dl. 

The following table indicates the accuracy obtained: 


air hydrogen 

v=0.13 v= 0.94 
x¥=0.18 XY=1.3 
vl = 0.27 vl =1.9 


These results should be considered good. Agreement within an 
order of magnitude cannot be viewed as accidental when it is recalled 
how greatly the quantities with which we are dealing vary. 

Using the expression for the coefficient of thermal conductivity 
in terms of the thermometric conductivity, we obtain 


MCV 
x~Qulc,~—— ; 


where m is the mass of a molecule. 

In this formula, n, the number of molecules in a unit volume, has 
cancelled out. It follows, therefore, that the thermal conductivity of 
a gas does not depend on its density and, hence, pressure. We should 
carefully note this unexpected, but nevertheless perfectly correct, 
conclusion. Increasing the density of gas does not lead to an increase 
in thermal conductivity. 


® 

* It should be kept in mind that, in addition to the concept of diffusion 
of one substance in another, there is the concept of self-diffusion, i-e., the mo- 
tion of molecules among similar molecules, e.g., the diffusion of hydrogen in 
hydrogen, oxygen in oxygen, etc. Investigation of this phenomenon became pos- 
sible after the technique of radioactive tracers (atoms and, hence, molecules) 
was introduced. 

Thus, D is here the coefficient of self-diffusion. 
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Another prediction may be made on the basis of the formula for 
the coefficient of thermal conductivity. Since the effective cross- 
section and the thermal capacity hardly depend on the temperature 
(generally speaking, o decreases slightly with increasing tempera- 
ture), and the thermal velocity is proportional to V T, the coefficient 
of thermal conductivity should be proportional to the square root 
of the temperature. 

The data presented below indicate the accuracy of these two pre- 
dictions. For example, for nitrogen at 0° C, 325°C and 500°C: 


%,:M,2%, = 1.93:1.65:1 
VTy:VT,:VT, =1.68:1.48:1. 


We see that the thermal conductivity increases with temperature 


somewhat more than proportionally to V 7. Thisis due to changes 
in the cross-section and thermal capacity. As can be seen, the thermal — 
conductivity is independent of pressure in a very broad interval. 

Similarly, the dynamic viscosity n~gvl also does not depend on 
the pressure and density of the gas. The temperature dependence of 
the viscosity of an ideal gas should be the same as that of the thermal 
conductivity, i.e., the same proportionality should exist. A numeri- 
cal example will help to fix this point. 

For nitrogen (T, == 273°, T,=289°, T,=296°): 


N3:N.: 1, = 1.06:1.04:1 
VT :VT,:VT, = 1.04:1.03:1, 


The viscosity of a gas remains amazingly constant with changing 
pressure. When the pressure of CO, changes by a factor of 380—from 
2mm to 760 mm of mercury—the viscosity practically does not 
change. It remains at all times equal to 14.8 107° poises—to within 
an accuracy of one unit in the third figure. 


85. Ultra-Rarefied Gases 


When the free path length in a gas is greater than the linear di- 
mensions of the vessel, we say the gas is ultra-rarefied. Under normal 
conditions, the magnitude of the free path length is of the order of 
10-° cm and is inversely proportional to the density. Therefore, at. 
a pressure of the order of 10°* mm of mercury, the free path length 
is measured in tens of centimetres. For vessel dimensions of about 
10 cm, a vacuum or ultra-rarefied gas is obtained at such a pressure. 

It should be noted that even in a vacuum the number of molecules 
per unit volume is large. For the pressure indicated above, 1 cm’ 
of gas contains tens of thousands of millions of molecules. 
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When molecules cease to collide with one another, and collide 
only with the walls of the vessel, the gas acquires certain special charac- 
teristics. A number of concepts become meaningless under such con- 
ditions. Thus, it is no longer possible to speak of the internal friction 
of the gas molecules, since there can be no molecular layers exchang- 
ing velocities in the gas. It is no longer possible to speak of the pres- 
sure of one part of the gas with respect to another (however, the con- 
cept of gas pressure against the walls of the vessel retains its meaning). 
Also, the concept of heat exchange between different parts of the gas 
and, in general, all concepts related to interaction between different 
parts of the gas become meaningless. An ultra-rarefied gas interacts 
only with bodies within the gas. 

It will be useful to illustrate by means of examples the specific 
character of vacuum as a special physical state of a gas. 

What is the expression for the heat flow from one plate to another 
when these plates have different temperatures T, and T, and are lo- 
cated in a vacuum? The essence of heat exchange in this case consists 
in the following: gas molecules strike a wall and rebound from it with 
an average velocity corresponding to the temperature of this wall. 
As regards the expression for the heat flow, examining the familiar 


formula 
T,—T, 

ee : 
we see that the change consists in the fact that the role of free path 
length is now played by L, the distance between the walls. Therefore, 
the expression for heat flow should assume the following form in 
the case of ultra-rarefied gases: 

q= ecu (T,—T,). 


According to this formula, when ultra-rarefied gases are rarefied 
still further, the heat flow should decrease after the free path length 
becomes comparable to the linear dimensions of the vessel. And this 
is precisely what experiment shows to be the case. 

Also peculiar to an ultra-rarefied gas are the equilibrium condi- 
tions for a gas in two communicating vessels at different tempera- 
tures. In the case of ausual gas, the gas pressures in both vessels are 
the same at different temperatures, but the gas densities are differ- 
ent, i.e., they are inversely proportional to the temperatures. 
Equality of pressures is necessary for equilibrium, for otherwise gas 
molecules will be knocked from one vessel into the other as a result 
of molecular collisions. ; 

In the case of a vacuum, the situation is completely different. In 
this case, no collisions occur between molecules and as a result the 
molecular flow between vessels is not impeded. The equilibrium 
condition consists in equality of molecular fluxes. If there are n 


T,—T, 
ecul pie 7 
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particles in a unit volume and the particles move with a velocity v, 
then nv molecules pass through a unit area per unit time. Thus, for 
equilibrium, n,0,—=n,v,. Since the number of molecules in a unit 
volume is proportional to the pressure divided by the temperature 
(this follows from the equation of state for an ideal gas) and since 
the molecular velocity is proportional to the square root of the tem- 
perature, the condition for equilibrium assumes the form 


Pr __ ie 
VT VT," 
Thus, it is not the pressures that are equal, but rather the ratios of 
the pressures to the square root of the temperatures. If the gas den- 
sity is increased, the pressures gradually begin to be equalised and 
the usual equilibrium condition is obtained when the free path 
length becomes sufficiently small. 
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CHAPTER XIV 


ELECTRIC FIELDS 


86. Vector Properties of Electric Fields: Intensity and Displacement 


The presence of an electric field in a region may be recognised by 
a variety of properties. Thus, an electric field creates a force that acts 
on electric charges. Also, it can induce electric charges on the surface 
of a neutral metallic body. 

By measuring the force acting on a charge qg, one can show that 
the force F has different magnitudes and directions at different points 
in space, and at a given point is proportional to g. Hence, it is pos- 
sible to describe an electric field by its intensity E, which is defined 
as follows: 

E=—., 
q 
The stipulation should be made that g must be small. Then, E may 
be measured at points in space that are sufficiently close to each 
other and the field created by charge q does not noticeably distort 
the measured field. 

A vector field is frequently characterised by so-called vector flow 
lines. The tangent at each point of such a line coincides with the di- 
rection of the vector at this point. This also holds for electric fields, 
which may be characterised by vector flow lines of electric intensity E. 

Numerical examples. 1. The ee field intensity of an incandescent wire 
is tens of volts per centimetr 

2. The electric field jitensily. of the Earth close to its surface is ~ 100 volts/ 
metre== 1 Statvolts (Gaussian units). 

3. The electric field intensity of a hydrogen atom’s nucleus at a distance 
corresponding to the radius of the electron’s “orbit” is 19.2X 10° statvolts/em= 
=57.6X 10% volts/metre. 


4. The electric field intensity at which air breakdown occurs is 30 kv/on= 
=100 statvolts/cm. 





An experiment for the determination of the charge induced by a 
field may be conducted using two small metallic plates fastened to 
an insulated handle as shown in Fig. 89. Such plates are called Mie 
plates—after the German physicist G. Mie. Placing the pair of closely 
spaced plates in a field, and then carefully rotating them, positive 
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charge may be accumulated on one plate and negative charge on the 
other. Moreover, the quantity of induced electricity may be measured 
by an electrometer or ballistic galvanometer. 

Experiments show that the plates may always be so oriented that 
no electricity is induced on the plate faces. For homogeneous, iso- 
tropic bodies (and for the present we shall not consider others), this 
occurs when the plate faces are parallel to the vector E. On the other 
hand, the induced electricity is a maximum when the plate faces 





Fig. 89 


are perpendicular to the vector E. This enables us to introduce still 
another vector to describe an electric field, namely, the electric dis- 
placement , which is defined by the following condition: The vector 
® is normal to the Mie plates when the orientation of the plates is 
optimal with respect to induced charge, i.e., when a maximum 
charge is induced on them. Moreover, this vector is directed outward 
from the positive Mie plate. In all cases, except for anisotropic bodies, 
® and E have the same direction. The absolute value of D is equal 
to a: 
[D| =9, 


where o is the surface charge density of the Mie plate. Since the sur- 
face charge density o may be written as a , then 
L 


dq 
|>|=3. 
=a 


It was stated above that the electric field may be characterised 
by the flow lines of vector E. We can, of course, also describe the 
field by the flow lines of vector 9, i.e., the electric lines of force. 
The number of lines of force passing through a unit area perpendicular 
to the force lines is |D\j=D, and the quantity 


dN =DdS , 
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is called the electric flux through the area dS,. If the same electric 
flux passes through the inclined area dS as through dS, then 


Asien 954, 


cosa ’ 


where a is the angle between the normal to the area and the lines of 
force, i.e., 
dN =D cos adS. 


The flux through a large surface is expressed in the form 
= y D cos adS, 


and the flux through a closed surface is usually denoted by placing 
a circle on the integral sign: 


N=$D cosadS. 


87. Permittivity 


Experiments show that the two vectors characterising an electric 
field are related. For the case when these vectors are parallel, they 
are also proportional to each other at a given point in space.* A 
change in the vector E results in a proportional change in the vector 


®. The ratio 2 depends only on the medium. 


It is customary to characterise the electric properties of a medium 
by the dimensionless quantity &, whose value is selected so that e=1 
for vacuum. The reason for this condition is the fact that, as will 
presently be seen, no body can exist with e<1. Therefore, it is 
natural to “base” the value of e on vacuum. The quantity ¢ is called 
the permittivity and is defined by the equation 


Ee 


where e, depends on the choice of units. If the state of the medium 
does not vary from point to point, then e is also constant. At the 
boundary between two media, ¢ changes abruptly. Bodies that are 
nonhomogeneous with respect to density and other properties are 
usually nonhomogeneous with respect to permittivity as well. 

The permittivities of several substances at 18°C are as follows: 
air—1.00059, glass—7.00, paper—2-2.5 and water—80.5 : 


* The case of anisotropic media, where the vectors S and E are not parallel, 
will be considered on p. 254. 
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In engineering, the quantity D is measured in coulombs/metre? 
and the field intensity in newtons/coul. Then, e, is measured in 
coul*/metre? newton and, in these units, 


__ 107° coul? 
&o = 36x metre? newton‘ 


In the Gaussian system of units used in physics, e, is dimensionless 
and equal to i: 
E 
=7,F. 


A quantity called the electric induction D may be used instead of 
displacement. It is 4a times larger than displacement and in the Gaus- 
sian system D=eE, 

As we shall soon see, both choices for the value of e, have their 
relative advantages. The first system simplifies one group of formulas 
but complicates another, while the second system leads to the reverse 
result. 

It should be emphasised that the concepts of electric displacement 
and electric induction have exactly the same qualitative meaning. 
The difference in the numerical factor merely leads to a difference 
between the ratio of an electric induction unit to a charge density 
unit and the ratio of a displacement unit to a charge density unit. 

The electric displacement is equal to unity if the charge density 
on the Mie plates is equal to unity (see p. 224), while the electric in- 
duction is equal to unity if the charge density on the Mie plates is 


equal to re 


In electrical engineering, as a rule, only the quantity 9, i-e., 
displacement, is used. On the other hand, in physics, the electric 
induction D is used exclusively. 


Several comments are necessary regarding the formulas and units of meas- 
urement that are used in this part of the book. 

Although in mechanics and thermodynamics various choices are made for 
the fundamental quantities and various units of measurement are used, never- 
theless, the constants of proportionality are invariably dimensionless. There- 
fore, the form of the formulas in those fields of physics does not depend on the 
choice of the system of units. 

Unfortunately, however, the situation is not the same in the case of elec- 
tromagnetic fields. Two general approaches exist, i.e., one approach has been 
adopted in electrical engineering and another in physics. Not only is there a 
difference in the choice of the fundamental quantities and the units of meas- 
urement, but it turns out that we must distinguish between the constants of 
poo for one and the same formulas. Of necessity, one must become 
amiliar with the formulas in both systems. This will be done in the course of 
the presentation, but for the present it suffices to limit ourselves to several 
general comments. 
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In electrical engineering, the so-called MKSA system is widely used. Side 
by side with the metre, kilogram and second, a unit of current is taken as fun- 
damental. A current of 1 amp is defined as the current for which the constant 
of proportionality p,, i.e., the magnetic permeability of vacuum, which occurs 
in formulas for electrodynamic interaction (see p. 269), has the value 


by=4m X1077 joule/amp? metre. 


Experiments show that for such a choice of unit current 1.118 mg of silver 
is deposited per second on an electrode when a current of 1 amp passes through 
a silver nitrate solution. The historical reasons for this choice, which may ap- 
pear strange, will not be explained here. 

All units in the MKSA system may be expressed in terms of the kilogram, 
metre, second and ampere. 

Since in electrical engineering the system of units is based on four funda- 
mental quantities, there is no way of obtaining the same set of formulas in 
the CGS system, which is based on three fundamental quantities. There are, 
however, other differences between these two systems. These are expressed in 
the different choice of numerical, dimensionless coefficients. In the course of 
presentation, we shall on occasion list certain formulas in both systems, while 
in the appendix the reader will find a compilation of the electrodynamic for- 
mulas in both systems with the units of measurement indicated. 


88. Electric Field Relations 


Consider a system of electrically charged bodies forming an arbi- 
trary field. Now, describe a closed surface in this field. Some of the 
charges will fall within the surface, while others will be external 
to it. The result obtained by measuring the electric flux in the out- 
ward direction through this surface is very simple and in no way sur- 
prising. Thus, the total electric charge induced on the surface— 


which by definition is precisely the flux N=$9® cosadS—is equal 
to the total electric charge within the volume enclosed by this surface: 


GD cosadS = Yq. 


This theorem, named after Gauss and Ostrogradsky, shows that lines 
of electric flux begin on charges of one polarity and terminate on 
charges of the other polarity. Interrupted lines of force do not exist. 

Lines of electric intensity closed on themselves do not exist in 
a constant electric field.* This follows from the second law for elec- 
tric fields, which states that an electric field (more accurately: the 
vector field of electric intensity E) is a potential field. Thus, {he 
work performed in moving a charge along a closed curve is equal to 
zero in such a field, i.e., closed lines of vector E do not exist. The 


* In vacuum and homogeneous media, the E and 9 vector lines coincide, 
In this case, we can speak of the electric lines of force without indicating which 
of the vectors is being considered. 
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work performed in moving a charge from one point to another de- 
pends only on the location of these points and does not change when 
the form of the path changes. In this respect, the properties of an 
electric field are the same as those of a gravitational field. 

Let us select a reference (initial) point in an electric field and cal- 
culate potential energy with respect to this point. No matter what 
path is taken, the work A in moving a charge from the initial point 
to a given point in the field is always the same. Therefore, at this 
point the charge possesses a potential energy U that is numerically 
equal to the expended work A. 

Just as the potential energy in a gravitational field is proportional 
to the mass of a body, the potential energy in an electric field is 
proportional to the charge: 

U=qq. 
The quantity g=s i.e., the potential energy that a unit positive 
charge would possess at a given point in the field, is called the electric 
potential of the field or, simply, the potential. 

The expression for the work performed in moving a charge from 
one point of the field to another follows from the definition of poten- 
tial. Since work is equal to the change in energy, i.e, dA =—dU, 
then 

dA = Fdl = qEdl = — qdyq, 
or 
Edl = — dq, 


where dg is the change in potential. 
For a finite portion of the path 


2 
| Ed =9,—4,. 


Thus, the potential difference* is equal to the work expended in 
moving a unit charge. 
If a charge moves along a line of force, vectors need not be used. 


Then, 
{ Edl=9,-9,. 


* In a variable field, the above equation is not valid. To avoid confusion, 
2 
it is convenient to introduce a separate term for | Edl. We refer to it as the 


1 
electromotive force (emf) along the path between points 1 and 2. For con- 
stant fields, the emf and the potential difference are equal. 
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Finally, in a uniform field, the formula is simplified to 


_ Pia Pa 
eT ae 


where d is the distance between points 1 and 2. 
Formulas relating E and @ are written without constants of pro- 
portionality and have the same form in all systems of units. 


Examples. 1. Assume that two flat electrodes of area S=10 cm? are located 
in air at a distance of 5 mm apart and that the potential difference between 
them is 5,000 volts. The intensity of the created electric field is E=10°% volts/ 
metre=33 statvolts'cm (Gaussian units) and the electric displacement in the 
field of this condenser is D=e,E=9% 107% coulomb/metre®. This means that the 
charge density on the condenser plates is o=9X 107° coulomb/metre?=2.7 stat- 
coulombs ‘cm? (Gaussian units). The electric flux through the face of the elec- 
trode is N=DS=9X 107% coulomb and the charge on one plate is gq=aS=9 xX 107* 
coulomb. Thus, N=q, which agrees with the Gauss-Ostrogradsky theorem. 

2. The electric displacement of the Earth’s field close to its surface is D~ 
~9X 107" coulomb/metre?. Since the area of the Earth’s surface is S ~5x 10" 
metres? and the surface charge density is o~9X 107 coulomb/metre’, the 
Earth’s charge is g~4.5X 105 coulombs. Thus, the electric flux passing through 
the Earth’s surface is N~4.5x 10° coulombs. 


89. Field Calculations of Simple Systems 


Using the electric field relations presented in the previous article 
and from general considerations of symmetry, we can determine the 
field for certain simple systems. To determine the field means to 
calculate the electric intensity, induction or potential. It should be 
noted that knowledge of the potential suffices to characterise the 
field. If @ is known at all points in space, wecan determine the value 
of vector E by differentiating g. This becomes particularly clear 
if we construct surfaces of equal potential (equipotential surfaces) 
satisfying the equation g(x, y, z)= const. Since the work of moving 
a charge along an equipotential surface is equal to zero, the lines of 
force are directed normally to the equipotential surfaces. Thus, to 
determine the value of |E|, one must differentiate @(x, y, z) in 
the direction of the normal. This type of mathematical operation 
is considered in vector analysis. However, such differentiation is 
easily performed graphically using a curve in which @ is plotted as 
a function of the coordinates along a line of force. The tangent of 
this curve’s angle of inclination at any point is equal to the negative 
of E at that point. . 

In order to enable the reader to better understand these new con- 
cepts, we shall use examples to analyse the peculiarities of potential 
and the vector characteristics of a field. We repeat, however, that in 
principle knowledge of the potential suffices to solve the problem. 
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Point Charge. From considerations of symmetry, onecan see that the 
field of asingle, point charge is a radial, spherically symmetrical field. 

Consider a sphere of radius r. The electric flux emanating from a 
charge q is equal to 


1D cosadS=q. 


The angle a is the angle between the lines of force and the surface 
of the sphere, i.e., it is equal to 90°. At all points on the surface, 
® has the same value and may, therefore, be brought out in front 
of the integral sign. Then 





Df dS—q 
and, since f dS=4nr* (area of the sphere), the electric displacement 
at a point located at a distance r from the charge is D= aa and 


the electric induction is D=-> 
The intensity of the electric field is 


pee Nae 
Ame er? * 
In this case, the Gaussian system of units in which e =e is more 
convenient. Then E=4, and in the case of vacuum 
eae 
=7 


Since the field intensity is equal to the derivative of the negative 
potential along the line of force, i.e., 


dp 
eer 


the expression obtained for the potential of a point charge is 
q 
gat. 


The constant of integration has been assumed to be equal to zero. 
This fixes the reference potential m =0 at infinity. 

Thus, the potential of a point charge is inversely proportional to 
the first power of the distance, while the intensity is inversely pro- 
portional to the distance squared. 

If the charge is in a medium whose eiorcele constant is e, the 


intensity and the potential are reduced to+ — of the values in vacuum. 
The Earth’s potential is equal to 0.07 volt if the potential at infinity is 


taken equal to zero. In electrical engineering, the potential of the Earth is as- 
sumed to be equal to zero. 
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Systems of Point Charges. Let us consider methods of calculating 
fields created by systems of point charges. Assume ¢=1 and let us 
use the Gaussian system of units. Then, the potential for a sysiem 
of charges may be written in the form 

—~ a1 9 q és “ 
Dg on gee ae Dare 
k 
where r, is the distance from the charge q, to the point of observation. 
In the case of two charges of equal magnitude, but opposite 


sign, we obtain 1 ' 
eGo 


When the signs are the same: 
1 1 
al aay 

The form of the above formulas may turn out to be inconvenient 
for solving a given problem. Thus, it is often expedient to introduce 
a Cartesian system of coordinates and express the radius r, in terms 
of x, y, z. In the case of two charges, separated by a distance 2a, it 
is convenient to locate the origin of the coordinate system at the 
midpoint of the system, with the x-axis passing through both charges. 
Then, 
r=(x—ayt+y? +z and r=(x +a)*+y?+2. 


Sometimes it is expedient to represent the potential as a function of 
the polar coordinates R and g. From Fig. 90, one can see that 


r,=V R®-+a*—2aR cos@ and r,=VR? +a? +2aR cos @. 
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The field intensity of a system of point charges is given by the vec- 
tor equation 





bea et Ea i. — YI re 


1 ; eee . 
2 OF 2 FT 20 FF 2 £ 
ny 1 nr 2 rs 3 lp Rk 


fp: : : : : : 
Here, —* is a unit vector in the direction of radius r,. 
k 


Vector addition is used to map the lines of force. 

Universal Formula for Potential. When a field is created by volume 
and surface charges instead of point charges, the potential of the 
field may be calculated if the charge distribution is known. 

Consider the region of the volume charge to be divided into in- 
finitely small volumes dv and the area of the reeae charge to be 


divided into infinitely small Seo dS. lie =f is the volume den- 


sity of the charge and c= is the surface density, the potential 


created by a volume du is equal to & ed? and the potential created 


by a surface element dS is equal to oA Adding the potentials creat- 


ed by all the elements, we obtain: 
__ ¢ ede odS 
= S a + i pee 


The radius r is drawn from the point of observation to all the points 
in space where charges edu and odS are concentrated. 

This formula is rarely used since the charge distribution as a func- 
tion of ge and o is not usually given. In fact, the charge distribution 
is generally being sought. 

Field of a Spherical Condenser. Consider a sphere of radius r, 
having a charge +-q and surrounded by a concentric, spherical surface 
of radius rp. It is convenient to view the external sphere as grounded, 
whereupon a charge —gq is induced on its inner surface. Consid- 
erations of symmetry indicate that the field is radial. If we describe 
a sphere of radius r between the condenser spheres and apply the 
Gauss-Ostrogradsky theorem, the result obtained does not differ from 
that for a point charge, i.e., 


q 
E=-. 
The potential equation has the form 


Q=— 7 +-const, 
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but the constant in this case should not be discarded as was done 
previously. As is well known, the potential of grounded metallic 
parts is usually taken equal to zero. It will be, therefore, more con- 
venient to set @=0 at r=r, rather than at infinity. We then obtain: 
const = — on ; 

The expression for the potential in the region between the spheres 
has the form 

q_ 4 


 eoeruaer ae 


On the surface of the inner sphere, 


ga Ue, de 
rr ra 


Recalling that the ratio of the charge to the potential difference 


between the conductors of a condenser gives the capacitance, we 
obtain for the capacitance of a spherical condenser 


| gota Slee oe TAR 
“11 rR—-TA, 


ra TB 





If the radius of the outer sphere is increased (r,— oo), the expres- 
sion for the capacitance is reduced to 


C=r. 
Thus, the capacitance of a single sphere is equal to the magnitude of 


its radius. 
If the dielectric between the conductors of the condenser has a 


permittivity ©, the intensity E and the potential @ decrease tot of 


the above values. 
From the formula 


ey eee | 
jar ie. Gan. 
we obtain for. the capacitance of the condenser: 
Cz . 
‘BTA 


and for a sphere, 
C=er. 


Thus, the capacitance is e times the value obtained for vacuum. 
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The potential and field formulas being used are applicable for 
points in the region between the conductors of a condenser. They 
are not applicable to points within the first conductor or external to 
both conductors, for Gauss’s theorem yields different results for these 
points. 

If the charge of the internal sphere is concentrated on its surface, 
then for points within the sphere 


f DcosadS =0. 


Since an analogous relation is valid for every surface within the 
sphere, this requires that D=0O and, hence, the field intensity is 
also equal to zero. Thus, Gauss’s theorem shows that there is no field 
within the sphere if all the charge is distributed on its surface. Since 
E=0, the potential m remains constant and equal to the value of @ 
on the surface of the sphere. The above is illustrated in Fig. 91 by 
the curves of E and @ as functions of r. 


E ¥ 


ae ee ee ee me ee ee 


Fig. 91 


Examples. 1. The electric field intensity at the Earth’s surface is 
7,400\ 2 
(5400) =1.33 


times the intensity at a distance of 1,000 km from the surface. 
2. The Earth’s capacitance is C=6.4X 10° cm (Gaussian units)=700uf. 
3. The capacitance of condensers used in radio engineering may be as small 
. a fraction of a picofarad (1 pi=107 id) and as large as thousands of micro- 
arads. 





Field of a Uniformly Charged Sphere. It is evident that outside 
such a sphere the field isthe same as for a point chargeor a surface- 
charged sphere, i.e., 


q 
E=4, 
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where r is the distance from the centre of the sphere and g= $axa® Qe 


(g is the charge density and a is the radius of the sphere). 
To determine the field inside the sphere, consider an auxiliary 
sphere of radius r<a. The quantity of electricity inside this sphere 
is less than g, being equal to 
4 r 


3 r'e= = gq. 


According to Gauss’s theorem 


3 
Dx dar? =*, q 
i.@., 
en es 
d= nas f° 


Hence, the electric field intensity is 


eee. | 
~~ 4ste,ea® Us 


or in Gaussian units: 


E= Sr, 


fez) 


It should be noted that the field is equal to zero only at the centre 
of the sphere. Then, as shown in Fig. 92, the field increases linearly 


and becomes equal to -4 at the sur- E 
€a 


face of the sphere (r—a). Here, the 
formula for the field outside the sphere 
and the formula for the field inside 
the sphere yield the same result. From 
this radius outward, the field de- 
creases in accordance with an inverse 
square relationship. The potential out- 
side such a sphere is again given by P 


<. Inside the sphere, the value of @ 


does not interest us and will not be Fig. 92 
considered. 
Cylindrically Radial Field. Let us consider the field created by 


a uniformly charged line or cylinder having a charge +4 per unit 
length. Outside the charged region, the fields of such systems are 
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the same and have the following form: the lines of force are at right 
angles to the axis of symmetry and the flux is the same in all radial 
directions. 

In order to apply Gauss’s theorem about the cylindrical line, let 
us consider an auxiliary cylindrical surface of radius r and unit 
height. Since the flux passes only through the lateral surface of this 


cylinder, DY cosadS is equal to the integral over the lateral surface. 


Owing to symmetry (cos a=1 and D is the same at all points of 
the cylinder), 


2(as=4, 
i.e., 
4 
D x Inn = 4 and Da 
1 ~ Qrer ° 


Hence, the field intensity is 





Me 
: 1 
E= 2n8,er 
or, in Gaussian units, 
27 
E=—. 
er 


Thus, the field of the cylinder is inversely proportional to the 
distance. This formula is equally valid for the region about a charged 
line, the region outside a charged cylinder and the region between 
the conductors of a cylindrical condenser. 

Since dp =—Edr, we obtain for the potential: 


g=24in + +-const. 


The potential decreases much slower with increasing distance than 
in the case of spherical systems. Thus, for example, when the distance 
r is increased to 10 times its original value, the potential decreases 


to 3 of its value rather than to a : 


For a cylindrical condenser, where the radii of the cylinders are 
a and b, we obtain : 
24 
ee E 


2 
(s— 9a = =F (Ina—In b) = In. 
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The capacitance per unit length of such a condenser is 


Gis & 
2 In 





Example. For a coaxial cable having an outer radius a=18 mm and an inner 
radius 6==6 mm, and filled with an insulator of relative permittivity e=4.2, 
the capacitance per unit length is C=1.91 cm/em=2.12 pijcm. 


It should be noted that the formulas derived above do not take 
account of field distortion at the ends of the cylinder and hence, 
strictly speaking, are valid only for infinitely long cylinders. Prac- 
tically, however, the derived formulas are valid if the region of 
“distorted” field is significantly smaller than the undistorted radial 
field. 

Uniform Fields. Uniform fields, i.e., fields in which the lines of 
force are parallel and evenly spaced, are created by charges in planes 
of infinite extent. Naturally, the flux is perpendicular to such planes. 
The magnitude of the field is again determined by means of the Gauss- 
Ostrogradsky theorem. Thus, let us consider an auxiliary surface 
in the form of a cylinder passing through a plane of charge. If the 
lateral surface of the cylinder is perpendicular to the plane, the flux 
through the auxiliary surface is equal to the flux through the two 


end surfaces of the cylinder. The integral ¢ DcosadS is then equal 


to 2DS, where S is the area of the cylinder base. The charge within 
the cylinder is equal to oS. Hence, the formula for the displacement 
becomes 


0 
D=>z- 
The electric field intensity is E= oo and in Gaussian units E = one . 


We see that the intensity does not depend on the distance to the 
sources of the field. Let us now consider a parallel-plate condenser. 
The field inside spherical and cylindrical condensers is created only 
by the inner surface of charge, while in a parallel-plate condenser the 
field between the plates is created by both surfaces of charge. Just 
as in the case of the condensers considered above, there will be no 
field outside the condenser. Between the condenser plates D =o and 
the intensity in Gaussian units is 


pats 
& 


In writing the expression for the potential of a uniform field, let 
us reckon the distance x from one of the charged plates in the 
direction of the lines of force. Thus, in the case of a single plate, the 
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potential is written in the form g=— ox -+ const. In the case of 


a condenser, the expression for the potential between the plates 
becomes 


o= —* ox + const, 
Hence, the potential difference is 
4 4 
Pa— Po == 9 (x, —X,q) = = od, 


where d is the distance between the plates. Thus, the capacitance 
per unit area of a parallel-plate condenser is 


see i i i 08 
C=75 or, in practical units, C=-F. 


The above formulas are exact only for plates of infinite extent. 
In practice, they may be employed if the effect of the condenser edges, 
where the nonuniformity of the field is pronounced, is not great. 
We can determine the field at some point by means of the derived 
formulas only if this point is sufficiently far from the edges. More 
specifically, this condition means that the field created by elemen- 
tary charges located at the edges of the plates should be much 
less than the field created in the neighbourhood of the point under 
consideration. 


Example. Let us return to the condenser considered on p. 229. The distance 
between the plates will be doubled using two different methods: 

Method 1: The plates remain connected to a source of voltage U=5 kv. Then, 

== 1.8 pf, q,=C,\U=9x107° coul, E,=7=10° volts/metre, Di=9X 
x107~%coul/metre? and Ni=9X107°* coul. 

After doubling the distance between the plates, we obtain: C2=0.9 pf, 
92=4.5X 107° coul, E2=0.5X 10% volts/metre, S2=4.5X 107% coul/metre? and 
No=4.5X 107°® coul. Thus, half the charge entered the source. 

Method 2: Before doubling the distance, let us disconnect the plates from 
the source (condenser charge g=const). C,=0.9 pf, g=q,=9x107® coul, 
U,=L=10 kv, Ez=Z=F,, T2=D1, and Ne=Ni. Thus, the voltage on 
the plates doubled at the expense of the work of external forces. 


Field on the Surface of a Metal Object. There is no electric field 
inside a metal object. This follows from the fact that all of a con- 
ductor’s charge is located on its surface. According to Gauss’s theo- 
rem, the field is directed outwardly. 

The surface of a metal object is obviously an equipotential sur- 
face, for otherwise the electric charges would redistribute themselves 
on the surface of the conductor. It follows, therefore, that the lines of 


Sec. 89] Field Calculations of Simple Systems 239 





flux leaving the surface of the metal object must be perpendicular 
to the surface. Since all the flux leaves the surface in a single direc- 
tion, then, according to Gauss’s theorem, D=o lines emerge from 
unit surface area. In other words, the field intensity at the surface 
of the conductor is equal, in Gaussian units, to p= *9 : 

Electric Images. Let us consider the electric field created when 
a point charge is placed near a plane metallic surface. Due to elec- 
tric induction, an electric charge of opposite sign accumulates on 
the surface of the metal near the point source. The density of the in- 
duced charge is greatest directly opposite the point source and de- 
creases to zero at infinity. Similarly, for the electric field. 

Let us now consider this problem quantitatively. Since the sur- 
face of a conductor is an equipotential, the conducting surface may 
be considered grounded without in any way reducing the generality 
of the problem. Hence, the potential of the metal plate is equal 
to zero and there is no field inside the plate. We are interested in 
the electric field in the right half-space. The electrical properties of 
this half-space are uniquely determined if the magnitude of the 
charge and its distance from the equipotential plane are given. It is 
important to note that it is entirely immaterial what is located to 
the left of the zero-potential surface. It is proved rigorously in 
courses on mathematical physics that the field in a particular 
region is uniquely determined if the charge in this region and the 
boundary conditions for the potential are given. 

In Fig. 93a, the field is plotted for two charges of equal sign. Now, 
consider the space of this field to be divided into two symmetrical 
parts. The half-space of this figure is then exactly equivalent to the 


a) 





240 Electric Fields {Chap. XIV 


half-space of a charge near a metal plate (Fig. 93b) and the fields of 
such half-spaces should be identical. This is the basis for the follow- 
ing procedure, known as the method of images. We “reflect” the 
electric charge in the surface of the metal plate. In the right half- 
space, the electric field of the charge and its “image” should coincide 
with the unknown field. Thus, the unknown electric field is expressed 


by the formula 
ene(t-2), 


where r, is the distance of the observed point from the charge and r, 
is the distance of this point from the charge’s image. 

The second conclusion to be drawn is that the electric charge is 
attracted to the surface of the metal plate with the same force as to 


its electric image, i.e., the force of attraction is ae where a is the 


distance of the charge from the surface. 

Finally, this approach to the problem enables us to determine the 
distribution of the induced electric charge on the surface of the metal 
plate. This requires that we differentiate the expression for the po- 
tential in the direction normal to the surface. We obtain thereby the 
electric field intensity E, which in accordance with the formula 
given in the preceding section of this article must be multiplied by 


& . 
7 to obtain the charge. 


The method of electric images has numerous applications and enables 
us to solve electrostatic problems involving systems of nonplanar 
conductors with point charges located in the vicinity of the conduc- 
tors. 


90. Electric Energy 


Energy of a Condenser. It is easily demonstrated that a charged 
electric condenser possesses energy. Moreover, the measurement of 
the magnitude of this energy is not difficult. Thus, for example, we 
can discharge a condenser through a.conductor and measure the ther- 
mal energy released thereby. However, we need not resort to experi- 
ment to determine the factors on which the electric energy of a con- 
denser depends. The formula for this energy follows directly from 
familiar theoretical propositions. 

To simplify this discussion, let us consider a condenser in which 
one of the conductors is grounded. The process of discharging ‘the 
condenser (grounding the second conductor), which is charged to 
a potential difference m by a quantity of electricity g, may be viewed 
as the successive outflow to ground of elementary charges dg under 
the action of electric field forces. Therefore, the work performed by 
the field in this simple process is equal to ¢dg. As the discharging 
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process proceeds, the work performed in transferring each successive 
quantity of gets to ground becomes less and less, for the potential 


difference g=d is constantly decreasing. The total work performed 
by the field during condenser discharge is 


q 

q,, ¢ 
| gag=h- 
Q 


This quantity is quite understandably referred to as the electric 
energy of the condenser. Using the relationship between potential 
and charge, we obtain for the energy: 

Vo F _ 9 _ Ce 


16 OS 

Thus, for constant potential difference, the electric energy is propor- 
tional to the charge squared. A constant difference of potential is 
maintained when the condenser is connected to a constant source. 
Moreover, if the condenser conductors are insulated, the charge is 
constant. Then, the energy is proportional to the potential squared 
and directly proportional to the capacitance of the condenser. 

Energy of a Field. In the case of a parallel-plate condenser of infinite 
extent, the energy formula may be written in theform W,, — 22?" when 


2d 


using MKS units or in the form W.= when an Gaussian 


units. These formulas give the energy per unit condenser area. 

The energy formulas may be written in terms of intensity E rather 
than potential difference @. Making the substitution @=Ed, we 
obtain 





&,eE? cE? 
Vo= =) d, or eae 


2 
Thus, the energy per unit volume is 2 Let us call w= oe the 
electric energy density. 

Now, let us consider an arbitrary electric field. Assume that the 
equipotential surfaces and lines of force are plotted and that the space 
is divided into small volumes dv, each of which is bounded by two 
adjacent equipotential surfaces and a lateral surface passing through 
lines of force. Each of these volumes is like asmall volume in a par- 
allel-plate condenser and, hence, the electric energy associated with 


such an element is dW = dv. If this expression is integrated over 


the entire volume occupied by the electric field, the formula obtained 
yields the electric energy of the system creating the field. 


9—3580 
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Thus, the formula for the electric energy has the form 
E2 
W y= \ G— do. 


The significance of the above mathematical transformations goes 
beyond the formal convenience of using one or another formula. 
This new expression for the energy enables us to speak not only of 
the energy of the system creating the field, but of the energy of the 
electric field itself, and leads to the concept of a real electric field. 
In the case of constant fields, this conception can neither be con- 
firmed nor refuted. However, in the case of varying fields, we find 
direct evidence for the existence of electromagnetic fields (see p. 317). 
Hence, the derived formula for the energy of a field (energy of elec- 
tromagnetic matter) is of fundamental significance. 


Example. Let us continue with the example considered on p. 238. Before 
the plates were moved, the energy stored in the electric field of the condenser 
was W1=22.5X 107 Sjoule and the energy density w1=4.5 joules/metre*. After the 
plates are moved by the first method (voltage U=const.), the energy becomes We= 
= N= 11.95x 10~® joule and the energy density we=1.12 joules/metre® (the 
volume of the field doubled). The energy of the source increases at the expense 
of the work of external forces and a decreas in the energy of the field. After the 
plates are moved by the second method (q=const.), the energy We=2Wi= 
=45X 107° joule and the energy density does not change, i.e., w2=4.5 
joules/metre®, 


Energy of Interaction. When two oppositely charged bodies draw 
together, the work performed by the forces of the electric field is, 
naturally, at the expense of the energy of the electric field: dA = 
=—dW,,. Thus, as indicated on p. 52, the work of the electric forces 
is performed at the expense of a decrease in the potential energy 


nae, This energy is appropriately called the interaction energy of 


the charges. 

What is the relation of this formula to the formula for the elec- 
tric field energy considered above? It is evident that the interaction 
energy is a part of the electric field energy of the charges under con- 
sideration. Carefully examining the formula for the field energy, 
we note that the electric energy has definite meaning even when there 
is only one electric charge in the region. The energy of the field cre- 
ated by a single charged body is appropriately called the self-energy 
of the electric charge. We can always resolve the electric field energy 
into the self-energies of the individual electric charges and the in- 
teraction energies of these charges. 

Let us designate by E,, Es, ... the field intensity created by the 
first, second, etc., charges. The total field is equal at each point in 
space to the vector sum of the intensities: E=E£,+£,+ .... 
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The electric energy density is 
ag (E, +E, +..."= e+ GE +... +E E, + REE +... 


Clearly, the individual terms of this expansion correspond to 
the energy components discussed above. Thus, the E-squared terms 
yield the self-energies and the terms involving the product of two 
different intensities yield the interaction energies. The interaction 
energies of the charges may be positive or negative quantities. On 
the other hand, the self-energies of the charges and the total energy 
of the field must be positive. 

As arule, only the interaction energies of electric charges are in- 
volved in a problem. We can calculate, therefore, the work of elec- 
tric forces by determining the decrease in the energy of the field. 
The easier calculation is the one that should be performed. 


91. Electron Radius and the Limitations of Classical Electrodynamics 


- Let us calculate the self-energy of a spherical charge, assuming 
the electricity to be distributed on the surface. The electric field 
is then only outside the charge. Therefore, the energy of the field 
must be integrated over the region external to the sphere. If the 
charge is in a vacuum, the field intensity is expressed by the formula 


+ and the energy density at any point in the region has the form 


oe. Consider the entire region to be divided into spherical layers. 


The energy contained in such a shell, whose inner radius is r and 


outer radius r-tdr, is a& xvol. of shell. Since the volume of 


the shell is equal to4r? dr, the energy in this spherical layer is given 
by the simple expression ae dr. To determine the total energy 


of the field, this expression must be integrated from a (radius of the 
spherical charge) to infinity. Thus, 


wat ee 
= 55 “2a ° 
a 


This is the form of the energy formula for an electrically charged 
sphere, 

We shall leave it to the reader to show that if the charge is dis- 
tributed throughout the volume of the sphere the energy formula 
obtained is almost the same. The only difference is that in this case 
the formula contains a coefficient close to unity. 

g* 
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What is the result if the above formula is applied to an elementary 
particle, e.g., an electron? 

According to the principle of relativity (see p. 420), the internal 
energy of a body of mass m is given by the expression mc’, where c 
is a universal constant equal to the propagation velocity of electro- 
magnetic waves in vacuum. Equating the two energy expressions, 
we obtain the formula for the electron radius: 

Qt 
Qq= me * 
Substituting numerical values * in this interesting formula, we 
obtain a=-1.4 107"* cm. There is considerable indirect evidence in 
physics that the order of magnitude of the electron radius determined 
in this manner is quite correct. 

Nevertheless, the conception of an electron as a “usual” electric 
particle is clearly false, for we are immediately confronted with the 
problem of the forces holding the component parts of an electron so 
close together. We know that the forces of repulsion between electric 
partic separated by a distance of the order of 107’? cm is tremen- 

ous. 

Furthermore, there are other theoretical difficulties. Thus, it 
follows from the theory of relativity that an electron should be a 
mathematical point. At the same time, the electric energy of a charge 
concentrated at a point is infinitely great. 

These difficulties are typical for so-called classical physics, which 
developed in the main in the 19th century. Classical physics excel- 
lently describes the behaviour of macroscopic bodies. In fact, by the 
turn of the century many scientists believed that classical physics 
was already so perfected that there was little left to be discovered 
in physics. After the discovery of elementary particles, it was natur- 
al to try to apply the laws established for large bodies to elementary 
particles. This is when classical physics began to “fail”. We now 
know that concepts derived from observations on macroscopic sys- 
tems cannot be simply transferred to atoms, nuclei and elec- 
trons. 

The electron problem cannot be solved within the framework of 
classical conceptions. Considerable success has been achieved in elec- 
tron theory during recent years, but a complete theory does not 
exist. Therefore, the classical theory of electricity (electrodynamics) 
presented in this part of the book has certain limitations. These 
are encountered in studying the interaction of elementary particles. 
When dealing with the behaviour of a single elementary particle in 


* g=1.601X 10-" coul; m=9.1066X 10~#* gm; and c=2.99776X 10°, 
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fields created by large bodies and, of course, when considering the 
interaction of macroscopic bodies, one obtains, using classical elec- 
trodynamics, results that are in complete agreement with experi- 
mental data. 


92. Electric Forces 


In calculating interaction forces between charged bodies, one often 
uses the-concept of an electric force field. Instead of saying that 
body A exerts a certain force on body B, we introduce a force field 
and say that body A creates a field and this field acts on body B. 
As we shall see in Chapter XVI, this field is more than an abstraction. 
An electromagnetic field is a physical reality and nature implements 
the interaction transmitted from one point in space to another (“short 
range” action). By introducing the field concept, we can ignore the 
field sources and determine the forces acting on a charged body 
knowing only the field intensities where the charges of the system 
under consideration are located. 

Every charged body is a system of charges. In the case of a system 
of discrete charges, the force acting on the system is F=g,E,+ 
+Q,E,+ .... Here E,, Eo, ... are the field intensities where the 
charges are located. When theelectric charge is uniformly distributed 
throughout a volume, the force acting on the body may be repre- 


sented by the following integral: F= ( Eodv. If the electric charge 
is distributed over a surface, the force is represented by a surface 
integral: F = | EodS. 


- However, one precaution must be taken when the force is deter- 
mined directly in this manner, namely, the valueof the intensity used 
in the formulas must be the intensity existing in the absence of the 
charge on which the force is acting. In the formulas in which the force 
is expressed as a sum, the action of charge q, on itself does not en- 
ter into intensity £;, i.e., in calculating F, the field created by 4, 
is not considered, etc. The same is true for the integral formulas, 
i.e., the field intensity under the integral sign is the intensity creat- 
ed by the entire distribution of electric charge, except for the quan- 
tity of electricity located at the point under consideration. 

‘Let us illustrate this by means of the force acting on a charged met- 
al surface. As we know, the electric field intensity on the surface of 


a metal bounded by a dielectric is equal on the dielectric side to 
AO in Gaussian units, and on the metal side is equal to zero. The 
field intensity is broken on this surface. To determine the force acting 
on an element of surface, we must multiply the quantity of electric- 
ity odS by the intensity which would exist-at this location if the 
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element of charged surface under consideration were removed. There- 
fore, it would not be correct to multiply odS either by axe 
the value of the field on the dielectric side, or by zero, the value 
of the field on the metal side. It can be shown rigorously that the 
field existing at this location after removing the nett ance con- 


sideration ee equal to the arithmetic mean of 0 and x, i.e., is 
equal to ane Thus, the formula for the force acting on “ areleniGet 
of a conducting body’s charged surface has the form 


and for the entire body 
2 
F= 2x! © ds. 


The integration in the above formula must be performed over the 
entire surface, taking into consideration differences in charge density 
and dielectric constant along the metal surface. 

In the case of a uniform field (ideally, between the plates of a con- 
denser of infinite extent), the force F acting on the plate area S may 
be determined with consider~ 
able accuracy Dy the formula 


F= = 8, 


The ieadiniide of this force 
may be measured by means of a 
Thomson balance, whose method of 
operation is illustrated in Fig. 94. 


When the potential difference be- 
> = tween condenser plates of 50 cm? 
Fig. 94 area is 600 volts and the distance 


between plates is 5 mm, the force 
of attraction between the plates, calculated in the two systems of units that 
we have been using, may b: determined as follows: 





Gaussian Units MKS Units 
eS eep9S  1X1079 5xX10—8 
C=7i—5 em C27 ga “5x0 > 
g=CU=8x2=16 statcoulombs =8.9x107 "Ff 
o=0.32 statfarad/cm? g=CU =8.9x 107? x 600 = 
F = 202 g— 2X8. 14x08 x 50= =5.3x107° coul 
: o=1.06X107® coul/metre? 
= 382 dynes — % o__(1.06X10-9)?X5X1078 
~~ Deeg = 107° = 
2x! (2—) 


=32x1075 newton 


Thus, in order to balance the force of electrostatic attraction, one must place 
a 33.3 mg weight in the opposite pan. 
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It is still more difficult to determine the force acting on a body 
having a distribution of volume charge. Here, in the expression 
eEdv, the intensity E is the intensity of the field created by all the 
charges except edu. 

If the charged body is in a dielectric medium, calculation of the 
force is complicated by the fact that when we consider the charge to 
be removed it is also necessary to consider a corresponding portion 
of the dielectric removed, which means the polarised state changes 
(see below). 

If we wish to avoid the difficulties connected with “subtracting” 
the effect of the charge on itself, the force must be determined from 
the energy expression. The decrease in energy is equal to the work. 
Then, if we know the magnitude of the displacement, we can deter- 
mine the value of the force. As a rule, this is precisely the method 
used in determining the force. 

Calculation by this method of the force acting on the plate of a 


parallel plate condenser, P= s , may serve as a vivid illustra- 


tion of the above. Observing the attraction between the plates of 
the condenser (disconnected from a source of voltage), we can imme- 
diately write the expression for the change in energy when the 
plates come together by an amount A: 


SAE AE sa, 
Jt & 


Hence, the force sought is 





panes. 


& 


93. Dipole Moment of a System of Charges 


Let us return to electrical systems that may be represented as sys- 
tems of point charges. Assume that the electric field is uniform in 
the region of the system of charges under consideration. Then, the 
formula for the force acting on the system has the form 


F=(7,19,+.-.-.)E=QE, 


where Q is the total charge of the system. If a body is electrically neu- 
tral, as in the case of an atom or a molecule, the force acting onthe 
body, which contains equal quantities of positive and negative parti- 
cles, is equal to zero. Does this mean that an electrically neutral body 
does not interact with the electric field? It is easily seen that the 
answer is no. In a uniform field, the forces acting on the charges of the 
system are parallel to each other. We can determine the resultant of 
the forces acting on the positive charges and the resultant of the forces 
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acting on the negative charges. As is well known, the resultant of par- 
allel forces is exerted at the centre of “gravity” of a body. Since we 
are dealing here with the electric centre of gravity, the word “grav- 
ity” has been placed in quotation marks. Thus, all the forces acting 
on the charges of a system located in a uniform field may be reduced 
to two antiparallel forces. One of the forces is applied at the centre 
of gravity of the positive charges and 
the other at the centre of gravity of 
the negative charges (Fig. 95). If the 
system is electrically neutral, the two 
forces are equal and the total force is 
zero. However, a couple of forces of 
moment M=gElIsina will act on the 
system of charges if the centres of 
“sravity” of the positive and negative 
charges are displaced with respect to - 
each other. 

The vector p=ql, equal in magni- 
tude to the product of the positive 

Fig. 95 charge of the system and the distance 
between the centres of gravity, is 
called the dipole moment of the system. It is considered to be 
directed from the negative centre to the positive centre. The di- 
pole moment of a system determines its behaviour in a uniform 
field. Thus, a system left to itself in a uniform electric field tends 
to turn until the dipole mo- : 
ment is parallel to the direc- 
tion of the electric field (sin 
a =0). 

In a uniform field, the en- 
tire effect on a neutral system 
of electric charges is reduced 
to the moment of force M= 
=pE sin a, where p is the 
dipole moment of the system 
and is equal to the product of the quantity of electricity of one 
sign and the distance between the dipole charges. Thus, in a uni- 
form fiéld, there is no néed to consider the complex distribution 
of a particular system of charges. It suffices to replace it by the 
corresponding dipole. eo 9 ; , 

If the system is located in a nonuniform field, the dipole moment 
can no longer completely describe its behaviour. This is illustrated 
in Fig. 96 where four charges, located af the corners of a square, 
comprise an electrically neutral system having a dipole moment 
equal to zero. This is because the centres of gravity of the negative 
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and positive charges coincide. In a uniform field, neither a force 
nor a moment of force acts on such a system. In a nonuniform 
field, however, it may undergo translatory as well as rotational 
motion, since generally speaking the forces acting on the charges 
differ. By analogy with the dipole, such a system of four charges is 
called a quadrupole. Another neutral system having zero dipole mo- 
ment, called an octupole, is also shown in Fig. 96 

Of great importance in the study of the structure of matter, to 
which we shall devote a great deal of attention later, is the consider- 
ation of the interaction of 
simple electric systems. Let 
us consider several such 
systems. 

Charge-Charge. The inter- 
action between two point 
charges is, in accordance 
with Coulomb’s law, 
Ras 4s : Fig. 97 

r 





Charge-Dipole. A dipole left to itself tendstoturn until it is par- 
allel to the lines of force. After it has turned, the dipole remains 
stationary in a uniform field, while in a nonuniform field it will be 
‘drawn toward the region of greater field intensity (Fig. 97). If the 
nonuniform field is the field of a point charge, the dipole will be 
drawn toward this charge. This force of attraction is 


rea(t—rtn). 


Assuming the distance between the dipole charges to be small, and 
reducing the expression to a common denominator, we obtain the fol- 
lowing interesting formula if we neglect the quantity /* relative 
to rf and the quantity r/ relative to r’: 


2 
F=q,3 


Note that the force of interaction between the charge and the dipole 
decreases more rapidly with increasing distance than the Coulomb 
force, i.e., it-is inversely proportional to the distance cubed. 

Example. The distance between the H atom and the Cl atom in an HCl 
molecule is equal to 1.28A, and the dipole moment of the molecule is p=6X 107" 


statcoul cm. Therefore, an electron located at a distance r=10A {rom the mole- 
cule is attracted to it with a force of ~ 6X 107° dyne. 


Dipole- Dipole. Here, it is convenient to solve the problem for 
the two cases in which the dipoles are arranged as shown in Fig. 98. 
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The exact interaction formulas have the form: 





Dy es. A cee 
=a Rae for arrangement (a) 
F = Fx a for arrangement (b). 


If the distance between the charges of a dipole is small the above 
formulas may be replaced by the 


ee sppproximate expressions: 
Pat for arrangement (a) 
is for arrangement. (b). 


4) Thus, the interaction forces are in- 
o | : versely proportional to the fourth 
—— 7 . 
power of the distance. 
Fig. 98 Example. Two HCl molecules that are 
10A apart are attracted with a force 
F~1075 dyne in the case of arrangement (a) and with a force F~2X 107° dyne 


in the case of arrangement (b). 


Charge-Quadrupole. Assume that the orientation is as shown in 
Fig. 99. The interaction force 
may then be written in the 
form 


rita? - r 
P= taal oan aa): 


and the approximate formula 
for a small quadrupole is 
Fe Thus, the force is 
inversely proportional to the . 
ourth power of the distance. Fig. 99 





94. Polarisation of an Isotropic Dielectric 


As we know, when a region containing an electric field created by 
a system of charges is filled with a homogeneous dielectric, the 
field intensity and the magnitude of the electric potential are de- 


creased to 4 of their original values. On the other hand, the electric 
displacement and induction remain unchanged, and the capacitance 
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of a condenser increases to ¢ times its original value. The latter 
effect is often utilised in the measurement of dielectric constants. 
Thus, the ratio of the capacitance of a condenser containing a di- 
electric between its plates to the capacitance of the same condenser 
without dielectric may be used as a definition of dielectric constant. 

Let us now go a step further and inquire into the reasons why the 
dielectric affects the electric field. The following experiment suggests 
the explanation for this phenomenon. 

Consider a parallel-plate condenser connected to a source of vol- 
tage. The electric charge density on the condenser plates and, hence, 
the number of D-lines per unit area are uniquely determined by the 


electric field intensity, i.e., one. Let us now fill this condenser 


with a homogeneous dielectric. The relation between the electric 
field intensity and the charge density on the condenser plates is then 
expressed by the equation oa fe , i.e., the flux density (or D-lines) 
increases. In this experiment, the electric field intensity cannot change, 
for it is equal to the potential difference divided by the distance 
between the plates. Therefore, the charge density on the condenser 
plates changes, i.e., it increases to e times its original value. This 
increase may be observed experimentally.Thus, as we fill the condens- 
er with the dielectric, the voltage source adds charge’ to the con- 
denser. By measuring the electric current and the time of flow, 
“one can show that the quantity of electricity added per unit con- 
denser area is 
eE FE _e—lp 
40 40 4x ~* 


As we remove this dielectric, the additional charge returns to the 
source and the additional force lines disappear. To explain the 
additional attraction of charge to the condenser plates, one must 


assume that charges of opposite sign, having a density o =f, 


are formed on the dielectric surface next to the condenser plates. 

The surface charge of the dielectric may be explained if we assume 
that the dielectric consists of bound pairs of positive and negative 
charges that cannot move through the body, but can move relative 
to each other, forming thereby a dipole moment in each unit volume 
of the dielectric. This transformation of an electrically neutral 
system of charges into a system having a dipole moment is called 
polarisation, and the dipole moment vector of a unit volume of 
dielectric is called the polarisation vector P. 

Polarisation of a dielectric does not produce volume charge. The 
numbers of positive and negative charges per unit volume remain 
equal to each other after displacement. However, polarisation does 
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produce a charge on the dielectric surface. This is illustrated schemat- 
ically in Fig. 100. The density of this charge is equal to the value 
determined above, i.e.: 
—!1 
Spo. = E. 


In our discussion, we have considered a dielectric adjacent to the 
plate of a parallel-plate condenser. However, the situation is the 
same for a conductor surface of any shape. Moreover, it turns out 

that the above expression 


LMU ee ity TBE" curtaces perpendic: 
i i i j i j i Heo force. Thus, 
UQUUUDDDQ0000 


e—1 
o= E ‘ 
Yikes G17, iT /y UY 
















4m n 





where E, is the projection 
7, of the intensity on the nor- 
“ mal to the surface. This for- 
mula is applicable to any 
Fig. 100 real or imaginary boundary 

in a dielectric. 

A polarised charge (also often called a bound charge) may be ex- 
pressed in terms of the dipole moment of a unit volume. When a 
field is applied in the case of isotropic bodies, the displacement of 
the bound charges occurs along the electric lines of force. Therefore, 
the polarisation vector is parallel to the intensity vector. From a di- 
electric plate, let us cut out a cylindrical rod having a base S and a 
length /. Owing to polarisation, equal and opposite bound charges will 
accumulate at the ends of the cylinder. The dipole moment of the 
rod is equal, by definition, to the product of the charge oS and the 
distance / separating the charges of the dipole, i.e., p=Opo,Sl. The 
dipole moment of a unit volume is |Pi =op,,. It has been assumed 
in this calculation that the base of the cylinder is perpendicular to, 
the polarisation direction. If the base is inclined at an angle a with 
respect to this position, the charge density on the ends of the rod will 
decrease as the cosine of the inclination angle. Thus, in the general 
case,. the following relationship holds: 


S 


Opop =P» Where P,=Pcosa. 


We are now able to establish a relationship between the polari- 
sation vector and the electric field intensity vector. Combining the 
last formula with the expression for the density of bound charges 
discussed at the beginning of this article, we obtain for any direc- 
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tion n: 





e—1 
P= ow 
Thus, if the dielectric constant does not depend on the intensity, 
a linear dependency exists between the vectors P and E: 


P=0aE. 


The expression a= fot is usually called the electric susceptibil- 


ity. For water a=6.38 and for glass a —0.48. 
Since D=eE, the relationship between the vectors D, E and P 
may be expressed in the form 


D=E-+-4nP. 


When the medium is homogeneous, the vectors D, E and P are 
parallel. 


95. Polarisation of Crystal Substances 


Hitherto we have considered the behaviour of a substance that is 
characteristic of an amorphous or finely crystalline body, or of a 
monocrystal in certain special orientations relative to the field. 
However, if a plate is cut from a monocrystal at an arbitrary angle 
to the crystal faces and if the plate is then placed between the conduc- 
tors of a condenser, the following effect may be observed: The plate 
becomes polarised perpendicular as well as parallel to the lines of 
force, so that P is not parallel to E. Therefore, in this case, D is also 
not parallel to the field intensity. 

In monocrystais, the inclination direction of a free electric charge 
(E) does not coincide with the direction of the normal to the surface 
oriented in such a manner that a maximum charge (D) is induced on 
it. The relationship between D and E becomes more complex and in 
order to find D in terms of E, or vice versa, it is insufficient to merely 
know the dielectric constant. In any monocrystal, three directions 
(main axes) in which D || E may be found. If we know e for these 
three directions, the relation between D and E for any arbitrary 
orientation of the crystal in a field may then be established. How are 
the vectors D, E and P related in this case? It turns out that the 
equation D—E--4nP, introduced in the previous article for.the case 
of parallel vectors, is also valid when the vectors cease to be paral- 
lel. There is another difference between crystals and amorphous bod- 
ies in regard to their dielectric properties, namely, a relatively 
small class of bodies belonging to crystalline substances possess 
hysteretic properties. Since these properties were first discovered 
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in Seignette (Rochelle) salt, such substances are called seignettoelec- 
tric materials. Their characteristics will now be discussed. 

Place a seignettoelectric material (for simplicity assume that 
we are dealing with a powder or crystal oriented in the field so that 
D || E) between the conductors of a condenser. Let us vary the voltage 
between the condenser conductors, and hence the field intensity 


E= E, and measure the charge density o on the condenser conduc- 


tors, which for DI|E yields the magnitude of the electric induction D. 

The magnitude of the induction D increases as E increases, but it 

is not directly proportional to E, for D begins to increase less until, 

finally, saturation sets in. Clearly, 

D saturation of D corresponds to satu- 

ration of polarisation. Let us now 

begin to decrease the emf between 

the conductors. Displacement and 

polarisation begin to decrease and 

the curve follows a downward path, 

but not the same one taken during the 

period of rise. As a result, when the 

é emf is completely removed (E=0), 

the induction and polarisation in the 

dielectric are not equal to zero. The 

dielectric becomes similar to a perma- 

nent magnet. It will have “north” and 

“south” electric poles and will behave 

like a large permanent dipole. 

Fig. 101 The subsequent behaviour of seig- 

nettoelectric materials is evident 

from the hysteresis loop shown in Fig. 101. To “de-electrify” the 

dielectric, the polarity of the emf on the condenser conductors must 

first be reversed. Then, by increasing E in this new direction, we can 

depolarise the dielectric. A further increase in E again electrifies it, 

but with opposite polarity. Finally, saturation sets in again and the 
process may be repeated in the reverse direction. 

Why is this effect called hysteresis? The word is derived from the 
Greek and means “to lag”. The loop illustrated in the figure shows 
that the values of D, as well as of P and e, depend on the past state 
of the sample, i.e., on its history. 

Every crystal that does not possess a centre of symmetry in a num- 
ber of its elements of symmetry (see p. 587) possesses an interesting 
property, namely, its dimensions change upon application of an 
electric field. This phenomenon is known as electrostriction. 

Thermodynamic considerations show that if an electric field pro- 
duces a deformation, the deformation in turn will produce polarisa- 
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tion. This is known as the piezoelectric effect. Applications of the 
piezoelectric effect were briefly discussed in Part I. The relation of 
this effect to the structure of matter will be discussed on p. 657. 


96. Finite Dielectric Bodies in an Electric Field 


The following questions may arise regarding a finite nonconducting 
body located in an electric field. What forces and moments of force 
act on such a body? How is the field distorted by the presence of the 
dielectric? 

A dielectric body placed in a field becomes polarised and acquires 
acertain dipole moment. Therefore, the behaviour of such a body 
in an electric field does not, generally speaking, differ from the 
behaviour of a dipole. If the po- 
larisation vector is directed at 
an angle to the field intensi- 
ty, the orientation of the di- 
electric will be unstable. A mo- 
ment of force will act on the 
body, which will tend to turn 
the body until the vectors P and 
E are parallel. 

Thus, a dielectric body placed 
in a given uniform electric field 
assumes a definite equilibrium 
orientation that depends on the 
form of the body. Let us illus- 
trate this for the case of a di- 
electric bar. 

Experiments show that the equilibrium orientation is the one for 
which the longitudinal axis is parallel to the lines of force. Why is 
this so? Is it not true that the bar does not have fixed poles? Fig. 102 
illustrates the reason for this peculiar behaviour. The forces acting on 
the bound charges of the depicted rectangular bar may be reduced to 
four forces acting on four surfaces of the bar. We see that the forces 
acting on the longitudinal surfaces almost balance each other, while 
the forces acting on the lateral surfaces form a couple of forces that 
orients the bar parallel to the lines of force. 

If the body is in a nonuniform field, then, in addition to tle mo- 
ment of force, there will exist forces tending to pull the dielectric 
toward the region of greater field intensity. This phenomenon may be 
vividly demonstrated by making a dielectric fluid rise in a tube upon 
applying voltage to a condenser. The forces making bits of paper 
cling to a glass or ebonite rod rubbed with fur or leather are of the 
same kind as those acting on a dipole in a nonuniform field. 
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Let us now turn to the question relating to the distortion of an elec- 
tric field due to the presence of a dielectric body. First, we shall 
show that the general laws for an electric field lead to an important 
relation between the values of the electric field on either side of the 
boundary between dielectrics. 

The electric field intensity vectors at two neighbouring points 
located on opposite sides of the boundary between dielectrics having 
permittivities ¢, and e, differ in magnitude as well as in direction. 
Let us resolve these vectors into components parallel and normal 
to the boundary. It can be asserted that the field parallel to the bound- 
ary is the same on both sides. If this were not the case, i.e., if the 

field on one side were great- 
€<&2 er than the field on the oth- 
E4)E to May . er side, it would be pos- 
sible to create a perpetual 
engine by moving charges 
along the boundary against 
the field where the field is 
less and then allowing the 
charge to move on the oth- 
er side of the boundary 
(where the field is greater) 
under the action of the 
electric field forces. There- 
fore, the tangential compo- 
nents of the intensity on 
both sides of the boundary 
. must be equal: 


rEg lip E},= E;,. 


Fig. 103 We shall use the Gauss- 

Ostrogradsky law to deter- 

mine the normal components of the intensity at the boundary between 
two media. Construct an auxiliary surface in the form of an infinitely 
thin disk so that the parallel surfaces of the disk lie on opposite: 
sides of the boundary. Since there is no charge inside such a disk, 
the net outward flux through the disk is equal to zero and, there- 
‘fore, the flux through each end is the same. This requires that 
the normal components of the induction vectors be equal to each 
other, i.e., D,, =D,,. Hence, in terms of field intensities, we obtain 





8,En, = 6, En,. 


Thus, the normal components of the intensity vectors are inversely 
proportional to their permittivities. 
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Fig. 103 shows that in passing from a medium of lower permittivity 
to one of higher permittivity the flux lines are deflected away from 
the normal to the boundary. This means that the number of flux 
lines passing through a unit area increases. 

We are unable to determine the distortion produced in an electric 
field when a dielectric having a particular shape is introduced into 





this-field. This problem is difficult even when the field is uniform to 
begin with. If a body of arbitrary shape is placed in such a field, the 
field becomes nonuniform not only near the body, but inside the body 
as well. : 

Interesting exceptions are ellipsoids, a broad class of bodies inckid- 
ing spheres, flattened ellipsoids that practically do not differ from 
plates, and extended ellipsoids that are akin to cylindrical bodies. 
In mathematical physics it is shown that the field inside an ellipsoid 
is uniform as indicated in Fig. 104. Applying the law of flux re- 
fraction, we obtain the typical fields illustrated for a denser body in 
a less dense medium (€,<e,) as well as for the reverse case (¢,>€,). 
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Examples are a glass ellipsoid in air and an air bubble in glass, re- 
spectively. 

It can be shown that if a symmetrical dielectric body is immersed 
in a uniform field E, in vacuum, then E, is related to the field inside 
the dielectric as follows: 

E,;=E,—NP, 


where P is the polarisation vector and N is a coefficient depending 
only on the shape of the body. In the case of magnetic phenomena, 
it is customary to call the latter the demagnetisation coefficient 
(see p. 289). 


: , e—1 ‘ : 
Since in most cases P=-——E,, we obtain the following expres- 
4n t 
sion after simple conversion: 


fost 


jaea 4 


The dielectric constant is always greater than unity. Hence, the 
field intensity inside the dielectric is always less than the field inten- 
sity present at this location before the dielectric was introduced into 
the field. 

The coefficient N for a flat plate perpendicular to the field is equal 
to 4x. This is the maximum value for NV and the resulting decrease in 


field to + of its original value agrees with the result obtained earlier 


for a homogeneous medium. Let us take another extreme case—a 
cylinder whose axis is oriented parallel to the field. Here, N=0, 
i.e., the field does not decrease in such a body. In all other cases, 
the decrease in the field intensity depends on the dielectric constant. 


For a sphere, N== and, therefore, E; = Ee For a cylinder 
whose axis is siichited perpendicular to the field, Ne 


The field intensity E decreases because the bound ee create 
a field of opposite direction. 

As regards the induction field, the bound charges affect it only 
indirectly. Thus, the number of D-lines remains unchanged when a 
dielectric is immersed: in the field. However, due to flux refraction, 
the induction inside the dielectric increases. 


- CHAPTER XV 
MAGNETIC FIELDS 


97. Magnetic Moment 


Magnetic fields act on currents, moving charged bodies or parti- 
cles and magnetised bodies. A variety of instruments exist for deter- 
mining the properties of a magnetic field. The most convenient way 
of characterising the properties of such a field is to describe its 
mechanical action on a current circuit. It is quite feasible to con- 
struct a wire circuit of very small area, which enables us to measure 
the magnetic field quite accurately. Thus, a “test” current circuit 
plays the same role in magnetic field theory as a “test” charge does 
in electric field theory. 

Experiments with such a device lead us to the following basic 
conclusions. At each point of the field, a circuit that is free to rotate 
assumes a definite equilibrium position. The position of stable 
equilibrium is described not 
only by the orientation of the 
circuit axis in space, but by 
the orientation of a definite 
side of the circuit, eg., the 
side for which the current will 
appear to be flowing counter- 
clockwise as viewed by an 
observer on that side of the 
circuit. Let us call this side 
positive.or north and agree to 
draw the normal to the circuit 
so as to form a right-hand 
screw system with the current 
direction. Thus, the normal 
emerges on the positive (or Fig. 105 
north) side of the circuit. 

If the behaviour of current circuits is compared with that of mag- 
netic needles, one observes that the normal to a circuit in stable 
equilibrium points in the same direction as a magnetic ngedle. 
Thus, the basic definition is not contradicted if we call the direction 
of the normal to a free test circuit the direction of the magnetic 
field. 

A torque will act on a test circuit that deviates from the equilib- 
rium position (Fig. 105). Moreover, the deviation of the circuit from 
equilibrium is uniquely described by the deviation of the normal to 
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the circuit from the direction of the field. It turns out that the sine 
of angle a and the torque N are proportional to each other, i.e., 
N~sin a. Furthermore, for a particular angle a, the torque is pro- 
portional to the product of the circuit area S and the current J flow- 
ing in the circuit. Decreasing the area by a certain factor results 
i the same change in torque as a decrease in current by the same 
actor. 

‘It follows from the above that the magnetic behaviour of a circuit 
depends on the orientation of the normal to the circuit and on the 
magnitude of the product /S. This can be expressed by means of a 
single vector quantity—the so-called magnetic moment of the ring 
current. In electrical engineering, the magnetic moment is generally 
designated by the vector M=/Sn, where n is the unit normal vector. 
In the Gaussian system of units used in physics, a constant of pro- 


portionality + enters into this formula, i.e., M= S ISn, where c is the 


velocity of propagation of electromagnetic waves in vacuum. The in- 
troduction of a numerical coefficient, which is moreover dimensional, 
may appear to be an unnecessary complication. However, other formu- 
las are thereby simplified. This will become clear to the reader 
later on. 

Experiments with a test circuit show that N=BM sina, where B 
is a constant of proportionality. The volume of B varies from field 
to field and for different points in space of a particular field. This 
formula shows that B is equal to the maximum torque acting on a 
unit test circuit (M—1). We call this coefficient B, which character- 
ises the magnetic field, the magnetic induction. The vector quantity 
whose direction is that of the magnetic field and which is numeri- 
cally equal to B is known as the magnetic induction vector. 

If the torque is described by a vector directed along the axis of 
rotation (in accordance with the right-hand screw rule), the formula 
for the torque may be written in vector form as follows N=[MB)}. 

When N =0, M is parallel to B. This means that a current circuit 
tends to become so oriented in a magnetic field that the directions 
of the magnetic moment and the field coincide. The magnetic moment 
acting on a body is a maximum when it is perpendicular to the 
direction of the field. For a circuit, this means that the plane of the 
loop of wire is parallel to the flux lines. 

Having determined the magnetic field by means of a current cir- 
cuit whose magnetic moment has been calculated from measurements 
of the current and the area, we can then do the reverse, namely, use 
the formula N=[MB] to determine the magnetic moment of systems 
whose currents cannot be measured. Moreover, we can transfer the 
concept of magnetic moment to systems in which the concept of a 
ting of electric current has no meaning. This is precisely the case 
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when the physicist refers to the magnetic moment of an electron or 
nuclear particle. The magnetic moment of a magnetic needle is also 
a concept that cannot be broken down. However, after having dis- 
cussed certain special effects of the medium, we shall return on p. 266 
to the magnetic moment of a permanent magnet. In any case, the 
magnetic moment of a system located in vacuum can always be de- 
termined by the above formula for the torque. 

A body possessing a magnetic moment requires the expenditure 
of work to turn it from its equilibrium position. In the case of a body 
turned through a small anglea, the work of rotation may be expressed 
in the form 


Nda = BM sinada = — d (BM cosa). 


The deviation of a body from the equilibrium position is associat- 
ed with the accumulation of a “potential” energy U——BM cosa. 
This product is the scalar product of two vectors. Hence, U=— BM. 

In the equilibrium position, the potential energy is a minimum 
and equal to —BM. When the magnetic moment is turned through 
an angle of 90° the potential energy increases to zero. Finally, when 
the magnetic moment is directed oppositely to the field (position of 
unstable equilibrium), the potential energy is a maximum and 
equal to +BM. 


Examples. 1. The magnetic moment of the nucleus of a hydrogen atom (nu- 
clear magneton) is 0.505 1072? Gaussian unit. The magnetic moment of an 
electron (Bohr magneton) is 0.927 1072? Gaussian unit. 

2. An electric current of 1a flowing in a loop whose area is 50 cm? creates a 
maguetic moment of 5X 107% a metre?=5 Gaussian units. 

3. In the Gaussian system of units magnetic induction is measured in gausses 
while in the practical system B is measured in volt sec/metre?; 1 volt sec/metre?= 
=10* gausses. In the case of the magnetic field of the Earth, B=0.49 gauss. 

4. The magnetic induction in the air gap of a powerful electric generator 
attains a value of several thousand gausses. Academician P.L. Kapitsa has 
obtained pulsed magnetic fields in which B ~10* gausses. 


98. Ampére Force 


‘The torque acting on a current-carrying circuit is clearly the re- 
sultant of the forces exerted on every part of the conductor in which 
current flows. We can experimentally establish the relation for the 
force acting on a current element. For this purpose, it is necessary to 
isolate a part of the circuit—e.g., by means of mercury contacts. 
This part is then able to move under the action of a force. Utilising 
the tension of a sprimg to counterbalance the displacement, one can 
measure the magnetic force (Fig. 106). 
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Ampeére first established the relation for the force acting on a cur- 
rent element of small length. This relation has the following form: 


I : I ; ae 
dF =~ (dl, Bj, i.e., dF =--dlxBsindl, B. 


The vector notation here is suggestive of the familiar left-hand 
rule. The force acting! on an element of wire length is always perpen- 
dicular to the plane 
passing through the 
current and the mag- 
netic induction vector 
at this location. To 
determine the sense 
of the force, note from 
which side the rota- 
tion of vector dl to- 
ward vector B, through 
the smallest an- 
gle, appears counter- 
clockwise. This is the 
positive side in a 
right-hand screw sys- 
tem and the force 
vector then points to- 
ward the observer. 
The force has a maxi- 

Fig. 108 mum value when the 

current element is 

perpendicular to the vector field. When the wire element is paral- 
lel to the flux lines, the force is equal to zero. 

The above formulas are in the form used in physics, i.e., valid 
for the Gaussian system of units. In the form used in electrical 


engineering, the coefficient 4 — is absent and the formula for the Am- 
pére force is 
dF =! (dl, B). 


To determine the magnitude of the force acting on a piece of wire 
of finite length, one must integrate the above expression for the force: 


F=- {dl, Bl. 


c 























In the simple case of a rectilinear piece of wire of length /, located 
in a uniform magnetic field B, Ampére’s law may be directly applied 


in the form 
F=+ T1B sin ij B. 
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A perfectly natural relationship exists between Ampére’s law and 
the torque expression derived in the preceding article. We shall 
consider only the simple case of a rectangular loop oriented parallel 
to the flux lines in a uniform magnetic field (Fig. 107). Two sides of 
the loop are perpendicu- I 
lar and the other two 
sides are parallel to the 
flux lines. Therefore, the 
forces acting on the wire 
elements may be reduced 
to the two shown in Fig. 
107. These forces are 
equal and in accordance 
with Ampére’s law may 
be written in the form Fig. 107 
F=IIB. As can be seen 
from the figure, the Ampére forces create a torque N=//Bd. But 
since ld=S is the area of the loop, we obtain N=/SB =MB, 
which is the same as the formula for the torque derived in the 
preceding article. We leave it to the reader to derive a more gener- 
al proof. 





Example. The force acting on a conductor whose length is 3 metres and through 
which a current of 50 a flows in a field of 3,000 gausses=0.3 volt sec/metre? is 
F=B/l=0.3X 50X 3=45 newtons ~4.5 kg. In the case of arotor diameter of ~1 
metre, the torque acting on the loop is ~45 newton metres. These values are of the 
order of magnitude of the parameters of a large electric motor. In an electrical 
measuring instrument, a force F=2X 107° newton=2 dynes~2 mg acts on a 
conductor of length 2 cm through which a current of 0.01 a flows in a field of 
100 gausses. For a loop diameter of ~ 1 cm, a torque of ~2X 107? newton metre 
acts on the loop. 


99. Force Acting on a Moving Charge 


We may go a step further and consider the magnetic forces acting 
on currents as forces applied to elementary particles of electric- 
ity. 

Electric current is simply a flow of electric charge. If e is the par- 
ticle charge, v the particle velocity and n the particle concentration 
(i.e., the number per unit volume), then the expression for the cur- 
rent intensity may be expressed in the form / =nevS. Thus, all the 
particles in a volume vS pass in | sec through the wire cross-sectiomS, 
i.e., the quantity of electricity flowing is equal to nevS (Fig. 108). 
acy nile this expression in the formula for Ampére’s law, we 
obtain: 


dF= < |B] nSdl. 
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But nSdi is the number of particles in the conductor volume under 
consideration. Thus, the force acting on one particle is 


== [vBl. 


This force is sometimes called the Lorentz force—in honour of Lo- 
rentz, the distinguished physicist who contributed so much toward 
the development of the theory of electrons. 


ae: :——_¢-4 


Uv 









an 


Fig. 108 


The above expression for the force (we shall restrict ourselves 
to the form used in physics, i.e., with the coefficient & immediately 


provides the answer to a number of very interesting actions regard- 
ing the nature of motion of electric particles (e.g., electrons and 
protons) in a magnetic field. The force acting on a moving particle 
is perpendicular to the flux lines and to the particle velocity vector. 
If a particle moves parallel to the flux lines, no force is exerted on 
it. On the other hand, the force is a maximum when the motion oc- 
curs in a plane that is perpendicular to the flux lines. In this case, 


we obtain f= evB. 


If the field is uniform, an electric particle mowing perpendicular 
to the field will describe a circle, for according to the fundamental 
law of mechanics this is the nature of the motion under the action 
of a constant force directed at right angles to the motion. We shall 
return to the problem of particle motion in a magnetic field later 
(p. 443). 


Example. Electrons in a cathode-ray tube ‘accelerated by a potential 
difference of 70 volts acquire a velocity of 5X 108 cm/sec. Upon entering a mag- 
netic field of 500 gausses at tent angles to the field, each electron experiences a 


Lorentz deflecting force of f=tevB= 4x107" dyne. Under the action of this 
force, the electron begins to move ina circular orbit of radius R, where R is 
determined from the relation f=7- Hence, R=5.6 cm. 


Sec. 100] Magnetic Fields Created by Permanent Mag nets 265 


100. Magnetic Fields Created by Permanent Magnets 


Every permanerit magnet has two poles.* The flux lines are di- 
rected outwardly at the north pole and inwardly at the south pole. 
Imagine a surface constructed so that it encloses the north pole. 
We can then determine the total number of lines passing outwardly 
through this surface. By analogy with the corresponding electric 
quantity, this number is called the magnetic flux and is designated by 
the letter @. The flux through an elemental area perpendicular 
to the flux lines is equal to d®—BdS,. Thus, through any arbitrary 
area, dD=BdS cosa, where a is the angle formed with the flux 
lines by the normal to the area; and through the surface S, D = 


= \B cosadS. Finally, through the closed surface, D = GB cosa dS. 


The flux M,, directed outwardly at the north pole and inwardly 
at the south pole, is the fundamental characteristic of a magnet. 
The stronger the magnet, the greater Dy. This somewhat justifies 
the designation “quantity of magnetism’—which is only of histori- 
cal significance—for the quantity proportional to the flux, namely, 


m= Lo. Sometimes m is called the magnetic mass, but this term is 


even less appropriate. In electrical engineering units, m=. 

If the poles of a magnet are small (e.g., in the case of a magnetic 
needle), the flux lines close to the poles are directed radially. 
. Using the Gauss-Ostrogradsky theorem, 


§ Deos adS = 4nq, 


we derived the formula for the electric induction of an isolated charge, 
namely, D=+4 . Clearly, an “isolated” magnetic pole should yield 
a magnetic induction satisfying the analogous equation: 


B=", since GB cosadS = 41m. 


In the case of the electrical engineering formulas: 


= Fee since § BcosadS =m. 

" To be sure, “isolated” magnetic poles do not exist. The above for- 
mula has meaning only for long magnets having point poles, and 
then only close to the poles. Nevertheless, this method of dealing 


* The creation of magnets with any number of pairs: of poles is also con- 
ceivable. 
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with the pole of a permanent magnet is fully justified in practice. 
This can be clearly demonstrated by means of the expression for the 
field of a bar magnet considered as a magnetic dipole whose two 


AY “Vay 


sy 


Fig, 109 


poles m are separated by a distance /. Fig. 109 shows the field of a 
bar magnet and the ideal field based on the formula 


where r, and r, are the distances from the poles to the point under 


+ 


Bp 
Fig. 110 


consideration. The fields are seen to be very 
similar. 

Calculations yield good results for the field 
at large distances from the magnet. Thus, if the 
distances r, and rg are large relative to the mag- 
net length /, the distance between the poles of 
the magnetic dipole, we are fully justified in 
considering the poles as points. The calculations 
are exactly the same as the corresponding calcu- 
lations for electric interactions. Let us compare, 
for example, the values of the magnetic induc- 
tion created by a bar magnet at a distant point 
along the magnet axis and at a distant point 
perpendicular to the axis. In the first case, we 
obtain 


Bas eM 
™ 72 (+)? rs 8? 


where M =ml is called the magnetic moment of 


the permanent magnet. In the second case (Fig. 110), 


m M 
B=2, cosw=5, 
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Thus, the field along the axis is twice as large. 
In the electrical engineering system of units, the last two formulas 
assume, respectively, the following form: 


M M 
B= ont and B= igri 


Example. Let us calculate the magnetic induction created by a bar magnet 
of length [=10 cm at a distance r=1 metre from the magnet, measured along 
the axis. The magnet cross-section is S=3 cm? and the induction in the magnet 
is 500 gausses. 

The magnetic flux in the magnet, which is the same as the outward flux from 
the pole, is ®=500X 31,500 maxwells. Thus, a “magnetic mass” m = ++200_- 
= 120 Gaussian units is concentrated at the magnet pole. The magnetic moment 
of the magnet is 


M=ml=120X 10=1,200 ergs/gauss (Gaussian units). 


Then, for the magnetic induction, we obtain 
Bua 2M _ 21,200 


—"r? ~~ (10098 


=2.4x107® gauss. 


101. Magnetic Field Intensity 


Let us consider the interaction of an isolated magnetic pole and a 
current element (Fig. 111). The magnetic pole creates a field B at 
the location of the electric current. Therefore, in accordance with 
Ampére’s law, the force acting on the current element is 


dF=+1 (dl, Bl. 


In place of the magnetic induction, we can substitute the expression 
for a point pole. Since the field is direct- 
ed along the radius, we obtain the fol- 
lowing expression for the interaction 
force: 
poy [ae, <| 
cr r 

or 


dF = dtxsin dlr. 
cr 


It is quite natural to assume that the 
force with which a current element acts 
on a magnetic pole is represented by the 
same formula, except that the direction Fig. 111 
of the force is reversed. This assumption 
cannot be directly verified experimentally since an isolated pole and 
an isolated element of constant current do not exist. However, we 
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can verify the validity of the above statement by integrating the in- 
teraction forces in actual cases. It turns out that theory and experi- 
ment agree. 

Thus, the force exerted by a current element on a magnetic pole 
may be written in the form 


dF="1 [ as, +| 


or, in the electrical engineering system, i.e., without the coefficient 
a and replacing m by in: 


m r 
dF=@,1|dt, 2). 
A minus sign does not appear in this formula because a reversed 
radius vector has been assumed. The direction of r is always taken 
as the direction from the field source to the point of observation. 
Therefore, in the case of the force acting on the current, r was as- 
sumed to be directed from the pole to the current element. Now, when 
the force is exerted by the current on the pole, the radius vector r 
is assumed to be directed from the current element to the pole. 
The force acting on a unit magnetic pole is called the magnetic 
field intensity: 


dna se 
m 


Thus, our discussion has shown that the magnetic field intensity 
created by a current element is given by the formula: 


dH=S, [at <]. 


r 


In the electrical engineering system of units, this formula for the 
magnetic field intensity created by a current has the form 


_ at, 4] 


Thus, a magnetic field may be characterised in two different ways, 
namely, by the induction vector and the intensity vector. The former 
measures .the action of the magnetic field on a current and the latter 
measures the action of the field on a magnet. 

In practice, it is easier to reduce the measurement of intensity to 
the measurement of the torque acting on a magnetic needle (Fig. 112). 


dH= 
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Such a needle located in a uniform field is subject to the action of a 
couple of forces, where the magnitude of the force is equal to mH 
and the arm of the couple is equal to / sin a. Hence, for the torque, 
we obtain the expression 


N=MH sina. 


In vectorial form N=[MH], where M=ml is the magnetic mo- 
ment of the needle, It is seen that this formula is very similar to that 
for the torque acting on a cur- 
rent-carrying circuit. 

The relationship existing be- 
tween the magnetic field inten- 
sity and the magnetic induction 
can be determined experimental- 
ly. It turns out that in all cases, 
except in the case of anisotropic 
bodies, the intensity and induc- 
tion vectors are parallel to each 
other. This means that the mag- 
netic needle and the axis of the f- 
test circuit are always parallel. 
Moreover, in all cases, except Fig. 112 
in the case of ferromagnetic sub- 
stances, a simple linear relationship exists between H and B, namely 
B=p,uH, where uw, is a universal constant (the so-called magnetic 
permeability of vacuum) and p is a coefficient characterising the 
médium (the relative magnetic permeability of the medium). 

In the system of units used in physics, p,=1. This yields 
the same dimensions for the magnetic induction and the intensity. 
The price that must be paid for this identity, however, is 


the introduction of the dimensional coefficient - in the formula 


for Ampére’s law. In the system of units used in electrical engineering, 
the magnetic permeability of vacuum is 





Bw, =4nx 107" joule/a’ metre. 


102. Interactions of Currents and Magnets 


The relations considered in the preceding sections enable us, fn 
principle, to calculate the interactions of any magnetic system. 
We have at our disposal formulas for the forces and torques acting 
on devices by a magnetic field of any origin: 
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Action on a current 
Action om a magnet 


Physics formulas Elec, eng. formulas 
F=- [dl, By F=I{dl, B] F=mi 
N= (MB), N=[MB], N=[MH], 
where M= t IS where M=IS where M=ml 


Formulas relating fields to their sources: 








Field due to current Field due to magnet 
Elec. eng. Physics Elec. eng. 
Physics formulas formulas formulas formulas 
I r I r m m 
dH=-a[a +] | dH=ga la 7] BS B= ae 
2M M 
B=ypH B=ppH B= B=55 


Substituting any of the lower formulas in any of the upper ones and 
using the relation B=p,wH, we obtain the formulas for magnetic, 
electromagnetic, magnetoelectric and electrodynamic interaction. 
We shall illustrate each type of interaction by an example. 

Magnetic interaction, i.e., the action of one magnet on another, 
Two poles separated by a distance r interact in accordance with 
Coulomb’s law, i.e., 

mm, M,Mz 
F= ye OF Pa 





The interaction force is inversely proportional to the magnetic per- 
meability. 

Electromagnetic action, i.e., the action of a current on a magnet, 
A current element exerts a torque on a magnetic needle. For simplic- 
ity, we assume that M | H, i.e., the magnetic needle is perpendi- 
cular to the flux lines. Then, 


dN = "4 dt sin dtr or 


dN = alsin dl; r. 
Ter 


The interaction does not depend on the magnetic permeability, 
i.e., on the properties of the medium. 
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Magnetoelectric action, i.e., the action of a magnet.on a current. 
Consider a current-carrying circuit located along the extension of 
the bar magnet axis at a dis- 
tance r from the magnet (Fig.113). 
The torque acting on the cir- 
cuit is 


2MourMnar . 
N=M,,,B sina = —“"% sin a 
or 








Thus, the interaction does not 
depend on the magnetic permea- of 
bility. 

Example. A circuit of area S= 
=20 cm’, through which a current B 


J=10 a flows, interacts at a distance Fig. 113 
of 100 cm with a bar magnet whose 
magnetic moment is M nag= 1,000 Gaussian units. The torque acting on the 
circuit is 
2M eurM mag 
fad 7 


N= 


Mear= agro X 10X20X3X 10° = 20 Gaussian units and 
N=4x107? dyne cm. 


Electrodynamic action, i.e., the action of one current on another 
current. Two parallel currents are attracted with a force 


i A 
dF =~ dl,B, where B=ppH =p} dl,, 
ie., 
dF ap lata 


cr? 


1 ,I,dl,dl, 
4nr? 


or dF=p,p 


The interaction is directly proportional to the magnetic perme- 
ability. 

The formulas for the interaction of magnetic systems may be de- 
rived in exactly the same manner. 


Example. \t is essential to take into account electrodynamic interaction jin 
the laying of bus bars. If a short circuit should occur, the bus bars and their 
supporting insulators must be sufficiently firm to withstand large electrody- 
namic forces. Assume that a current /1=/2=3%X 10* a flows in parallel bus bars 
separated by a distance d=20 cm. A force F=B/=wp,H/ acts on a unit length 


of each bus bar, where H= is the magnetic field intensity created by the 
linear current flowing in the other bus bar (see p. 276). 
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Thus,. 


pol? — 4rX10—-7X9X10% | 
F-= a oe = 900 newtons, 


i.e., a force of ~ 90 kg acts on each metre of bus bar. The same result could 
be obtained by integrating the last formula for dF above. 


103. Equivalence of Currents and Magnets 


We have called attention to the fact that a similarity exists be- 
tween the expressions for the torque acting on a magnetic needle and 
the torque acting on a current-carrying circuit. As a matter of fact, 
these two systems behave extremely alike in an external field. If 
each of the systems is characterised by its magnetic moment vector, 
the likeness appears even greater. Both systems tend to become ori- 
ented in the magnetic field with the magnetic moments parallel to 
the flux lines of the field. If the magnetic moment is displaced from 
the position of stable equilibrium, a torque N=[M#Hj acts on the 
system in the case of the magnetic needle and a torque N=[MB] 
in the case of a current-carrying circuit. The potential energies of 
these systems are represented by the formulas U=—MAH and Us 
=— MB, respectively. 

If we recall thal B —p,yH, the difference between the two formulas 
becomes immediately evident, i.e., they differ only with respect 
to the magnetic permeability factor. Hence, as regards mechanical 
action, a magnetic needle of moment M is equivalent to a current- 


carrying circuit of moment M,= 7 : 


However, the analogy between these two systems goes even fur- 
ther. We shall now show that the proper fields of a magnetic needle 
and a current-carrying circuit are alike except for a constant factor. 
This similarity occurs at distances considerably greater than the di- 
mensions of the system. We shall prove this for a point in space in 
line with the magnetic moment at a distance r from the centre of 
the system. The field of a magnet for sin a point has already been 


calculated and found to be equal to B=" —;. It remains to determine 


the field of a circular loop of current Abie the axis of. the loop. 

In Fig. 114 are shown the vectors of the field intensities due to two 
arc elements of current directed into and out of the page, respectively. 
The field intensity vectors are perpendicular to their corresponding 
current elements and radius vectors, i.e., they are in the plane of the 
page. The sense of the intensity vectors. is determined from the vector 
product rule or, what amounts to the same, the right-hand screw 
rule. | 

Since a current element and its radius vector are perpendicular td 
each other, the field created by such an element is equal in this case 
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to aH =. Now, let us add the two vectors depicted in the figtire. 
For the field created by the “antipodes”, we obtain 
dH = a cos f, 
cr 


where the meaning of the various designations is evident from the 
figure. This value of field is yielded by every such pair of “antipodes”. 
Therefore, the resultant field is obtained from the last expression by 
replacing dl, the length of an element, by xa, half the circumference 
of a circle. Thus, the field in- 
tensity along the axis of a cir- 
cular current, at a distance r 
from the current,* is given by 
the formula 


H= 


Qa? 
cr® 





Since 4 TS is the moment of a 


circular current (here S =xa’), 
we obtain: Ha and B = 
2M Fig. 114 

=Phy Sr ‘i 

. Thus, we have proved that a magnetic dipole and a current-carry- 
ing circuit are not only equivalent as regards the forces acting on 
them, but also as regards the fields created by them. The nature of 
the equivalence is the same in both cases. A magnetic needle of mo- 
ment M can be replaced by a current-carrying circuit of moment 


_M 
C Bity * 








In vacuum and for the system of units used in physics, i.e., for 
uy, =1, the equivalence principle becomes even simpler. In this case, 
a magnetic needle of moment M is equivalent to a current-carrying 
circuit having the same moment. 


Examples. 1. Let us return to the example on p. 267 and calculate the mag- 
netic induction of the same magnet in the practical system of units: 
B=0.05 volt sec/metre?, S=3X 1074 metre?, 
@= 15 107° volt sec, m=15X 107° volt sec, and 
1=0.1 metre, and M=ml=15X 107~°X0.1=1.5X 107° volt sec metre. 


* Since we are dealing with large distances, the difference between r and 
the distance to the centre of the system is negligible. 


10—3580 
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ee ee 
Hence, 
B= =2.4x 1077 volt sec/m?, 
Qar’ 


which is in complete agreement with the result obtained on p. 267. 
2. Consider a current-carrying circuit for which /=5 a and S=2 cm*. The 
magnetic field intensity created at a distance r=50 cm (along the axis of the 


circuit and perpendicular to its plane) is H=22 : 


1 1 
M=-~ IS=3y59% 


and H=1.6x1075 oersted. 





5x3x10°x2=1 erg/gauss 


104. Rotational Nature of a Magnetic Field 


A study of the nature of magnetic lines shows that magnetic lines 
differ basically from electric fields. Electric lines have a beginning 
and an end, i.e., there are no closed lines in a constant electric field. 
On the other hand, experiments show that magnetic flux lines, i.e., 
vector lines of magnetic induction, are always closed. In other words, 
such lines have neither beginning nor end. 

For reasons discussed above, force fields in which the work along 
a closed path is equal to zero- are known as potential fields. Vector 
fields characterised by closed flux lines are known as rotational 
fields. A magnetic field is a rotational field. 

If we describe a closed surface in a magnetic field, the net outward 


* flux D= GB cosa dS through such a surface will always be equal 
to zero. In other words, the number of lines entering this surface 
is equal to the number of lines leaving it. Thus, the equation 
GB cosadS=0 is the mathematical expression of the fact that 
magnetic flux lines have neither beginning nor end. 

The magnetic lines always encircle the currents creating the field. 
Therefore, the integrals taken along induction or intensity flux 
lines, ie., 6 Bal and ¢ Hal, respectively, differ from zero. It is more 


convenient to consider the second integral, for its value is propor- 
tional to the magnitude of the electric current encircled by flux lines. 
This can be seen from the basic field intensity formula, which shows 
that H and current strength are directly proportional to each other. 


By analogy with electrostatics, | Hal is called the magnetomotive 
force (mmf). If the integral is taken along a flux line, then 


§ wat = Hal. 
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The magnetomotive force along a closed curve is proportional to the 
current encircled by this curve: 


f Hdl-= kl, 


where & is a coefficient of proportionality. 
A flux line may encircle more than a single current. Then, using 
the algebraic sum of the currents, the equation assumes the form 


Gf Hdl =k SI. 


Deeper theoretical analysis, which we are unable to go into here, 
shows that the above equation is subject to two more generalisations. 
First, the integral need not be taken along a flux line, but can be ta- 
ken along any arbitrary circuit. Secondly, the coefficient of propor- 
tionality in the equation is a constant depending only on the proper- 
ties of the medium and is the same for all geometric conditions. 
Thus, the magnetomotive force is the same for any closed curve that 
encircles a current of specified strength. The shape of the curve 
and its length are of no significance. It is immaterial whether the curve 
encircles one current or ten currents and whether these currents are 
rectilinear or curved. As long as the algebraic sum of the currents 
passing through the closed curve remains the same, so does the mag- 
netomotive force. 

Since the coefficient of proportionality in the formula for the mag- 
netomotive force is a universal constant, we can determine k if the 
magnetomotive force can be calculated for any system whose field is 
known. . 

We are familiar with the general expression for the magnetic field 
intensity of a current element, but mathematical difficulties are en- 
countered in calculating the magnetomotive force by means of the 
formula for the field intensity, namely, 


I r 
dH=, [al, 2]. 
However, we are also familiar with the formula for the magnetic 


field intensity along the axis of acircular current, namely, H = - . 


No special difficulties are encountered in calculating the magnetomo- 
tive force along this axis. We should not be disturbed by the fact that 
the integration is carried out along a straight line, while we are in- 
terested in the magnetomotive force along a closed curve. As a mat- 
ter of fact, a straight line extending from minus infinity to plus in- 
finity is a closed curve, for it is closed at infinity. The expression for 


the magnetomotive force, \ Hal, along such a closed curve, i.e., 





fo* 
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along the axis of a circular current from minus infinity to plus in- 
finity, may be written in the form 





dl 
2M | ra 
where a is the radius and / is the distance measured along the axis 
of the circuit. The integral is easily evaluated by using the new var- 
lab B, defined by me formula £ cot B. It turns out to be equal 
to = = . Substituting 4 Ina’ for M and equating the value of the 
agnictonislive force te kI, we obtain 


Rk -* (in the Gaussian system of units) 
k=1 (in the electrical engineering system of units). 
The magnetomotive force relation now assumes the form 


GHdt=*S1 o $Hdt=Y 1. 


The magnetomotive force relation is very useful in determining 
the magnetic fields of various systems. Its application is facilitated 
by considerations of symmetry and in this respect the following dis- 
cussion is quite analogous to the discussion relating to the solution 
of the corresponding problems in electrostatics by means of the 
Gauss-Ostrogradsky theorem. 

Let us first consider an infinitely long rectilinear current. From 
considerations of symmetry it is evident that the flux lines must be 
circles whose centres lie on the wire axis. It is similarly evident that 
at all points on such a circle the numerical value of the intensity is 
the same. Applying the magnetomotive force relation to such a flux 


line, we obtain: Hf d=*% /. Here, gal is simply the length of a 
flux line. If the points under consideration are located at a distance 
r from the wire axis, then Gg di= 2nr. Thus, for the magnetic field of 


an infinitely long rectilinear current in the region outside the wire, 
we obtain 


H=2 (in the Gaussian system of units) 
Hs (in the electrical engineering system of units). 


Let us now determine the magnetic field intensity inside the wire. 
Assume that the radius of the wire is designated by a and that the 
current density is uniform over the wire cross-section. Hence, the 
flux lines within the wire must also be circular. Consider such a flux 
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line of radius r. The current flowing through it will be Cl, There- 
fore, the magnetomotive force relation yields: 


4x 
Cc 


Hx 2ar= “21, and H=2 41, 


a 
In the electrical engineering system of units 


Pi I 


= longi 


Thus, we see that the magnetic field intensity along the wire axis is 
equal to zero. The intensity increases with radius and becomes a 
maximum at the surface of the wire. Then, for the region outside 
the wire, the field intensity decreases, being inversely proportional 
to the distance from the axis (Fig. 115). 

If the field is determined at a point for which the distance r is much 
less than the distance to the end of a wire, then the formula Hae 
is also valid for a wire of finite dimensions. 


Example, Let us calculate the magnetic field intensity at a distance of 5 cm 
from the axis of a rectilinear current of 
20 a. H 
In Gaussian units (J=20X3xX 10°= 
=6% 10! statamps): 


2l 26 x 1010 
H== =Fx1owxs =0.8 oersted. 





In practical units (/=20 a and 
r=0.05 metre): 
I 20 amps 
H= oe oao nO metre 

Another important example of 
the application of the magnetomo- 
tive force relation is in the calcula- Fig. 115 
tion of the field of a solenoid. 

Consider a uniformly wound toroidal solenoid whose circumieren- 
tial length is L. The field within the solenoid is uniform and all the 
flux lines are concentric with L. This system plays the same role in 
magnetic field theory as a parallel-plate condenser of infinite extent 
in electric field theory. Each flux line envelops all n turns. Therefore, 
the magnetomotive force along a flux line of length L is 


G§ Hdl=*" nl. 
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Since f Hal =HL, we obtain 


H =F I (Gaussian system) 


H=FI (electrical engineering system). 


The magnetic field intensity of the coil is determined by its “ampere- 
turns”, i.e., the product of the current strength and the number of 
turns per unit length of the solenoid. It should be noted that the sim- 
plicity of the last formula is one of the justifications for the electrical 
engineering system of expressing the field equations. Since a solenoid 
is one of the basic elements of electrical engineering devices, simpli- 
fication of the formula for the calculation of its magnetic field intens- 
ity is of great practical significance. 


The formula H=+ / is also valid for a straight solenoid if used to 


determine the field within the solenoid at points sufficiently far away 
from the edges. 

Example. The magnetic field intensity at the centre of a long, thin solenoid, 
where L=15 cm, n=1,500 turns and /=0.1 a, is 


H = 1,000 2785 


metre ° 





In the Gaussian system: . 
Hat pat  (0.1x3x 10%) = 12.56 oersted; 








e L *~ 3x10 
amp ee -3 amps 
1 pe AMX 10 oersted and 1 oersted = 80 a atre 


105. Law of Electromagnetic Induction and Lorentz Force 


The phenomenon of electromagnetic induction discovered by 
Faraday, the great English physicist, may be described as follows: 
An electric current is induced in a closed conductor loop if the value 
of the magnetic flux passing through the loop changes. Moreover, the 
inducted emf is proportional to the rate of change of the magnetic 
flux, i.e., the derivative with respect to time 


ae where O= | Beosads. 

We shall show that the law of electromagnetic induction is closely 
related to the existence of a Lorentz force. If the electromagnetic in- 
duction is due to the displacement of a wire in a magnetic field, then 
ae law of induction follows from the expression for the Lorentz 

orce. 

In order to avoid confusion due to difficulties of a purely mathemat- 
ical nature, let us simplify the proof by assuming that the induced 
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emf afises in a rectangular circuit oriented perpendicular to the 
flux lines in a uniform magnetic field. A change in flux produces a 
translatory displacement of one side of the rectangle of length / 
as shown in Fig. 116. Since there are free charges in the displaced 
conductor, these charges are subjected to the action of a Lorentz 


force f = = vB when the conductor moves with a velocity v. In view 


of the fact that the velocity, magnetic field and conductor are at 
tight angles to each other, we are 
able to dispense with the vector 
notation in the formula for the 
force since the sine of the angle is 
equal to unity. The Lorentz force 
is directed perpendicular to’ the 
plane passing through the direction 
of the velocity v of the charges, 
and hence of the wire, and the di- 
rection of the magnetic flux lines. 
Thus, the force is directed along 
the wire. The charges are impelled 
to move and an induced current is 
thereby created. 
The electromotive force is equal Fig. 116 
to the work of moving a_ unit 
charge around a closed circuit. Since the force acting on a unit 


charge is equal to 0B, the work of this force along the moving 





wire is equal to + oBl. Moreover, no work is performed along the rest 


of the circuit. Hence, the last expression is the expression sought for 
the induced emf. It has the form: 


gue =+ Bl (in the Gaussian system of units) 


6'"4= vBl (in the electrical engineering system). 


Assume that the wire moves a distance dx during the time dt. 
Thus, the area of the circuit increases by the amount /dx =dS and the 
magnetic flux by the amount dD=BdS. Since v =% the induced 
emf may also be written in the following form: LAs. But this is 
precisely the expression for Faraday’s law of electromagnetic induc- 
tion, ie, G"4= —& in the Gaussian system of units and 


G4 2 in the electrical engineering system. 
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We have thus demonstrated that electromagnetic induction and 
the deflection of moving electric charges in 4n external field represent 
different aspects of one and the same law of nature. In the next chap- 
ter, we shall return to this interesting problem. Here, it was only 
hecessary to present the essence of the electromagnetic induction law. 


106. Measurement of Magnetic Fields by Means of Induced Impulses 


The phenomenon of electromagnetic induction is utilised in the 
design of precision instruments for the measurement of magnetic 
fields. Assume that it is necessary to determine the value of the 
magnetic field at some point in space. A 
small, flat coil or a single turn of wire is 
placed in a magnetic field perpendicular to 
the flux lines and the ends of the winding 
are connected by means of leads to the 
terminals of a ballistic galvanometer. Now, 
if the coil is rapidly turned through a 90° 
angle in such a manner that its flat sur- 
face becomes parallel to the flow lines, 
an electrically induced current flows in the 
winding as the coil is being turned. The 
brief flow of current, which rapidly reaches 
a maximum and then drops to Zero, is 
called an induced impulse (Fig. 117). During this brief interval, 
a certain quantity of electricity flows in the wire. The charge can be 
very accurately measured by means of a ballistic galvanometer, a 
device having a moving coil with a high moment of inertia that in- 
tegrates the electric current over the period of the impulse. 

If the resistance of the coil is R and the number of turns n, the in- 
duced current strength may be written in the form 

fixe n dD 
~ R dé ° 





Fig. 117 


The quantity of electricity flowing in the wire during the period of 
the induced impulse is 


Q=§ 1dt=R-'n { ad—R-'n(@,—), 


where @, is the value of the flux passing through the coil in the 
first position and @, is the value in the second position. 

@, or M, is equal to zero, i.e., magnetic flux lines do not pass 
through the coil in the initial or final position, the performed meas- 
urement yields the value of the magnetic induction. For this purpose, 
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we need only divide the value of the magnetic flux by the coil area S, 
i.c., B=~S 


_ Of course, other methods of measurement are also possible. Thus, 
instead of rotating the coil, the field may be switched on or off. Also, 
to make the effect more pronounced, the coil can be turned through 
an angle of 180° instead of 90°. This doubles the effect. Similarly 
the polarity of the field can be reversed, rather than simply switched 
on or off. : 

Since the measuring coil may be made as small as a square milli- 
metre, measurements by this method enable us to accurately deter- 
mine the magnetic field in small regions. 

This method is also used to measure magnetomotive force. For 
this purpose, we use a measuring device known as a Rogovsky belt— 
a long coil on a flexible belt. The belt may be shaped into any desired 
form and its two ends placed at any two points in a given region. 
Also, if desired, the ends of the belt may be brought into contact 
with each other. We shall show that when the field is switched off 
the deflection of a ballistic galvanometer connected to such a meas- 
uring belt is proportional to the magnetomotive force along the path 
of the flexible belt. 

The deflection of the ballistic galvanometer is a measure of the 
quantity of magnetic flux passing through all the turns of the coil. 
Let n be the winding density, i.e., the number of turns per unit length 
of the measuring belt. Then, on a small belt segment AJ,, there are 
nAl, turns, and the magnetic flux passing through these nA/, turns 
is equal to @, nAl,. 

If the medium is homogeneous and all the turns have the same 
area, then 

®;,=pSH,, 
and the total magnetic flux passing through the entire measuring 
belt is 
t 


Taking the limit for A/;+0, we obtain 


© =pSn | Hdl. 


1 


Since the measurement occurs in a medium for which p does not 
significantly differ from p,, the quantity pSn is a constant of the in- 
strument. The throw of the ballistic galvanometer in these belt meas- 
urements is exactly proportional to the magnetomotive force between 
the points at which the ends of the belt are located. 
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By means of this device, it is easy to demonstrate the validity of 
the laws discussed in Sec. 104. Thus, as long as the coil encircles 
one and the same current, the magnetomotive force will remain the 
same for all configurations. Also, it is easily verified that the mag- 
netomotive force along a circuit not encircling current is equal to 
zero. For the case when the coil encircles a current several times, the 
magnetomotive force can be shown to increase by the corresponding 
number of times, etc. 

It should be emphasised that magnetic field measurements by 
means of induced impulses areof particular importance when we are 
concerned with the magnetic field inside a solid body. The only other 
method available is to make a slit in the solid body. Such a procedure 
is usually not possible. 

Let us consider the most common problem—the measurement of 
the magnetic permeability of an iron body. The most accurate results 
are obtained when the substance under investigation is in the form 
of a toroid. Two windings are wound on such a ring—one connected 
to a current source and the other to a ballistic galvanometer. If 
current is flowing, a magnetic flux ®=BS passes through the ring. 
By reversing the direction of the current through the primary wind- 
ing, an induced current is produced in the secondary. The quantity 
of electricity Q flowing through the galvanometer is related to the 
magnetic induction inside the ring by a relation already discussed, 
namely: 

— QR 

= 78! 
where S is the cross-section of the toroid (assuming the turns are 
wound right on the surface of the ring), n, is the number of secondary 
turns and R is the resistance of the secondary winding. As regards 
the magnetic field intensity, it may be determined by the formula 


for a ring solenoid, namely, =a The magnetic permeability of 
the substance under investigation is then equal to B divided by H. 


107. Finite Bodies in a Magnetic Field 


To one degree or another all bodies possess magnetic properties. 
These are indicated, first, by the fact that a magnetic field exerts 
forces and torques on bodies and, secondly, by the fact that a body 
placed in a magnetic field distorts the field. As was indicated above, 
the magnetic properties of a substance are characterised by the coef- 
ficient u—the magnetic permeability of the substance. In accordance 
with the value of p, bodies may be divided into three distinct classes: 
ferromagnetic substances—including iron, nickel and cobalt—whose 
relative permeabilities are much greater than unity; paramagnetic 
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substances, whose permeabilities are somewhat greater than unity; 
and diamagnetic substances, whose permeabilities are slightly less 
than unity. Typical values are given in the table. 


Subs tance Be x 
Coppr.......-- 0.999990 — 1075 
Water 22 2d bees AS 0.999991 —9x107° 
Platinum ....... 1.000300 300 x 1076 
SiliGomi, scene Se 0.999986 — 141078 
Tungsten. 2... 2 eee 1.000079 79x 1076 


When a diamagnetic or paramagnetic body is placed in a magnetic 
field, the distortion of the field is negligible. On the other hand, when 
a ferromagnetic body is placed in the field, there is considerable dis- 
tortion. 

The forces exerted by magnetic fields may be detected without 
particular difficulty in the case of paramagnetic and diamagnetic 
bodies. In the case of iron objects, everybody is familiar with the 
fact that magnetic fields exert large forces. 

Let us first consider magnetic forces. A body that does not possess 
magnetic properties becomes magnetic when placed in a field. This 
magnetisation process manifests itself in the acquisition of a mag- 
netic moment by the body.As we know, a system possessing a magnetic 
moment may be detected in two ways. In a uniform field, the body 
tends to become oriented in such a manner that the direction of the 
moment is parallel to the external field. Moreover, in a nonuniform 
field the body will experience a force tending to move it along the 
lines of force. 

In the case of ferromagnetic bodies, the torque may be detected 
without difficulty. The magnetic moment of the body may then be 
determined from the formula N=[ MA]. However, we are not usually 
interested in a body having a particular shape, but are interested 
rather in the substance as such. Therefore, when possible the meas- 
ured value is recalculated on the basis of unit volume. The vector 
directed along the magnetic moment and numerically equal to the 
magnitude of the magnetic moment per unit volume is called the 
magnetisation vector J. Of course, the magnetisation vector can be 
determined without difficulty from the magnetic moment of a body 
only if we are sure that the magnetisation of the sample is uniform. 
This is the case when the sample is in the form of an ellipsoid or a 
degenerate ellipsoid, i.e., a cylinder, plate or sphere (see p. 257). 
That is why such bodies are used in experiments of this kind. 


234 Magnetic Fields [Chap. XV 


Determination of the magnetisation vector by measuring the torque 
is easily accomplished for ferromagnetic bodies. Since the torque 
is very small for paramagnetic and diamagnetic bodies, this meas- 
urement is difficult to perform. It is therefore preferable in these cases 
io measure the forces acting on a body located in a nonuniform field. 

Let us consider a small volume of a magnetic substance located in 
a nonuniform field. For simplicity, assume that the field varies 


along one axis and that the gradient is equal to ae Since a small 


volume of magnetic substance behaves like a magnetic dipole, the 
potential energy of a unit volume may be written in the form U= 
=—JH. If the moment acts along the field, the force exerted on a 
unit volume of the magnetic substance is equal to the derivative of 
the potential energy with respect to the coordinate, i.e., 


dH 
f=—Jz. 


Thus, if we know the field gradient, we can determine the magnet- 
ic moment of a unit volume of the body under investigation by meas- 
uring the force. In practice, there are various ways of accomplishing 
this, the simplest being by means of a so-called magnetic balance. 
A thread is passed through an aperture made in one of the pans of an 
analytical microbalance. Then, the sample is attached to the end of 
the thread and suspended between the poles of a magnet. Before and 
after energising the magnet, the sample is balanced; hence, the 
difference between the readings is equal to the value of the force f. 

The weights must be quite accurate as can be seen from the follow- 
ing example. A piece of bismuth, a substance whose diamagnetic 
properties are most’ pronounced, has a magnetisation J of 2x 107 
Gaussian unit when placed in a magnetic field whose intensity H 
is ~1,000 oersteds. If the nonuniformity of the magnetic field is 


- ~ 50 oersteds/em, a force of only 1 cm will be exerted on each 


cubic centimetre of bismuth, i.e., f~1 dyne/cm'*. 

Experiments show that for diamagnetic and paramagnetic bodies 
the following simple relationship exists between the magnetisation 
vector and the magnetic field intensity: 


J= “Hf, 


where x is the magnetic susceptibility. For diamagnetic bodies x 
is negative, while for paramagnetic bodies it is positive. Values of x 
are given in the table on p. 283. When x is positive the magnetisation 
vector is in the direction of the field intensity vector, but when x is 
negative, i.e., for diamagnetic bodies, the magnetisation vector is 
opposite to the field. 
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’ Due to this difference in sign, the belfaviours of these two classes 
of bodies under identical conditions are completely different. This 
is illustrated in Fig. 118. As can be seen, the differences are quite 
striking. A paramagnetic body is attracted toward the region of 


rE] o—] 


a) Diamagnetic body 6) Paramagnetice body 
Fig. 118 





strong field, while a diamagnetic body is repelled from such a region. 
In a uniform field, a paramagnetic needle tends to become oriented 
with its axis along the flux lines, while a diamagnetic needhe tends 
to become oriented perpendicular to the flux lines (see the analogous 
example in the case of a dielectric, p. 255). 

The determination of magnetic susceptibility by measuring the 
force in a nonuniform field may be accomplished for solid bodies in 
the form of a monocrystal or powder. This method is also easily 
adapted to liquids. In this case, the experiment can be so arranged 
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that the measured quantity is the increase or decrease in the level 
of the liquid as it is attracted toward, or repelled from, the region 
between the poles of a magnet. 


108. Relationship Between Permeability 
and Susceptibility 


Both permeability and susceptibility may be determined straight- 
forwardly. The permeability is determined from the formula p= Fy 
by measuring the induction and field intensity. The susceptibility 
is determined, as described in the preceding article, from the forces 
exerted on a magnetic substance. To be sure, the relationship between 
these two characteristics of the magnetic properties of a substance 
can be established experimentally. However, there is no need to do 
this since an exact and simple relationship exists between p» and x. 
This will now be shown. 

Let us return to the experiment for determining the magnetic 
permeability of a body in the form of a toroid. The primary winding 


creates a field of intensity Hat, which is independent of the sub- 


stance of the toroid, i.e., if the toroid were not present, the field in- 
tensity would be given by the same formula. The situation is differ- 
ent as regards the magnetic flux. It can be shown experimentally 
that the value of B depends on the magnetic permeability. If an iron 
core is placed in the coil, B becomes hundreds or thousands of times 
greater than in the case of an air core. This increase in magnetic 
flux is due to the magnetisation effect. 

First, it should be noted that the magnetic induction of a ring sol- 
enoid without an iron core (u,H) has the significance of magnetic 
moment per unit volume. 

The magnetic moment of one turn is equal to /S, for in this discus- 
sion we shall employ the practical system of units. For the total mag- 
netic moment of the system we obtain n/S, and the magnetic moment 
in unit volume, us 


Ts is simply equal to the field intensity. The 


magnetic moment of the equivalent dipoles is p, times greater (cf. 
Sec. 103). Therefore, the magnetic induction wu, of a uniform magnet- 
ic field created by the turns of a ring solenoid without an iron core 
can be expressed as the magnetic moment of the equivalent dipoles 
per unit volume. 

We are entirely justified in assuming that the significance of mag- 
netic induction is maintained if, without disturbing the uniformity 
of the field, the coil is filled with an additional number of magnetic 
dipoles. If J is the magnetic moment per unit volume due to the ad- 
ditional dipoles, the magnetic induction increases by this amount 
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and becomes 
B=p,H+J. 


Such an increase in B also occurs when the solenoid is filled with a 
magnetic substance. Since J =y,*xH, then B=p,(x+1)H; hence, the 
susceptibility and permeability are related by the equation 


po=l+x. 


Analogous calculations employing the Gaussian system of units 
lead to formulas with other coefficients. The magnetic moment of 
currents (and dipoles) per unit volume is 

nis 1 
[5° an 


Therefore, in the presence of a medium, 
1 1 ‘ 
qeb=aGZits, ie, B=H+4aJ. 


Letting J/=x’H, we obtain 


B=(1+4nx’')H. 
Hence, 
, , wx 
p=1-+40x’, where x ae 


Example. Let us perform the calculation in the example on p. 284 employing 
the practical system of units. For bismuth, x’=2x 107 5, i.e., x=4tx’ = 80 x 107 
The piece of bismuth is in a magnetic field whose intensity is H=1,000 oersteds== 


x 3 amps = 106 amps" 
= =X 10° 1,000 metre 4x metre ° 








Furthermore, the nonuniformity is expressed by 








oersteds 3 amps __ 5X10° amps 
Fe = 50 ie =50xq me 10° 100 metre? 4m metre? ° 


g Volt sec 
metre? 


The bismuth magnetisation is given by J=p,xH=8nx107 
the force acting on a unit volume (1 metre’) is 





. Hence, 


ext 10° newtons 


metre? * 


fos Ht =8nx107§ x2" = 10 








109 newtons __ 1 dyne 


ae cae which agrees with the result obtained in the previous 


Clearly, 1 
example. 
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109. Distortion of a Magnetic Field Due to the Presence 
of a Magnetic Substance 


The problem of magnetic field distortion has practical significance 
only when the distortion is due to an iron body. To a large extent, 
weshall be repeating the analysis given on p. 256 for the analogous 
case of dielectric bodies. 

At the boundary of two media of different magnetic permeability, 
the magnetic field vectors (induction as well as intensity) are re- 





Fig. 119 


fracted. To determine the law of refraction, let us first consider the 
magnetomotive force along a small circuit ABCD whose sides paral- 
lel to the boundary surface are close to each other, but on either 
side of the boundary as shown in Fig. 119. Since no current 
flows through this circuit, the magnetomotive force is equal to zero. 
Let us resolve the magnetic field intensity vectors on either side 
of the boundary into normal and tangential components. From the 
figure it is evident that the magnetomotive force can be equal to 
zero only if the tangential components are equal to each other: 


Hy, =H,,. 


Another condition at the boundary between two media is estab- 
lished by considering the magnetic flux passing through a small cylin- 
der (not shown in the figure) enclosing a portion of the boundary. 
Since the magnetic lines have no sources, the number of flux lines 
entering the cylinder through the top is equal to the number leaving 
through the base. The lateral surface has an infinitely small area and 
the flux through it is equal to zero. Now, let us resolve the magnetic 
induction vectors on either side of the boundary into normal and tan- 
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gential components. Clearly, the flux entering the cylinder can be 
equal to the flux leaving the cylinder only if the normal components 
of the induction vectors do not change in crossing the boundary: 
B,, =Ban- 

SEronr Test two rules, we can determine the law of refraction for 
flux lines. It is evident from the figure that 


tang, by 
tand, pe 


In passing from air into iron, the magnetic flux lines are deflected 
from the normal to a considerable extent and as a result the flux 
density sharply increases. For this reason, an iron body, whose per- 
‘meability is hundreds or 
thousands of times great- 
er than p, “absorbs”, 
flux lines. This is the 
basis of magnetic shield- 
ing. Magnetic flux can- 
not, in effect, penetrate 
into a region bounded 
by iron since practically 
all of the magnetic lines 
enter the iron (Fig. 120). 
The distortion of a Fig. 120 
magnetic field due to a 
magnetic body of specified shape is determined in exactly the same 
manner as in the case of a dielectric. If the iron body has the form 
of an ellipsoid, cylinder or plate, theoretical calculations show 
that the field inside the body will be uniform if the field was uni- 
form before the iron was introduced. A relationship completely 
analogous to that given in Sec. 96 exists between H,, the external 
uniform field before the body is introduced, and H,, the field inside 
the iron after the body is introduced. The field intensity inside 
the iron body becomes less than the original intensity by an 
amount proportional to the magnetisation: 





jer: ean 
Bo 


In order for the demagnetisation factor to be dimensionless, it is 
necessary to divide the magnetisation by the magnetic permeability 
of vacuum. Continuing with the practical system of units and sub- 
stituting ; 


J=p.(u—!) Ap 
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we obtain the following relationship between the external and inter- 
nal fields: 


H, 
A= Tia 


In the Gaussian system of units 
H; = H, —NJ, 
bol 
J= re H,, 
and the relationship between the external and internal fields is 
given by 
Ay 


a ae 
1+ 


The demagnetisation coefficient _ the same value as in the case 


of dielectrics: 
N= (N'=5) 
for a sphere, N=40(N’=1) for a plate, etc. 


110. Magnetic Hysteresis 


In discussing the permeability of iron, we may have created the 
false impression that the magnetic properties of ferromagnetic sub- 
stances differ from those of paramagnetic substances only as regards 
the magnitude of the permeability. This is by no means the case. 
Ferromagnetic bodies differ from the others mainly in that the mag- 
netic state of such a body is not linearly dependent, and moreover 
is not uniquely dependent, on the magnetic field intensity. Therefore, 
the concept of permeability for ferromagnetic substances is very rel- 
ative. The magnetic properties of iron are best illustrated by a mag- 
netisation vs. field intensity curve or a magnetic induction vs. field 
intensity curve. These curves are very similar to each other. 

Let us consider the magnetisation of an iron body as a function of 
the field intensity. At first, the magnetisation increases slowly, then 
rapidly, and finally magnetic saturation sets in. Such magnetisation 
curves were first used by A. G. Stoletov and are characteristic of all 
ferromagnetic bodies (Fig. 121). We reiterate: the magnetisation 
and magnetic induction curves are very similar. The slope of the mag- 
netisation curve gives the magnetic susceptibility, while the slope 
of the induction curve gives the magnetic permeability. From the 
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figure, it is seen that the permeability (also susceptibility) curve has 
a maximum. For weak fields the permeability is low. As the field 
intensity is increased, p increases to a maximum, then begins to drop, 
and after reaching saturation 
remains unchanged. When the B, gauss a 








value of the permeability is 15000 |— a 
given without specifying the pt at 
external conditions, the maxi- A 
mum permeability is usually 10000 pf} TA t+ +} 4 
meant, ES 

But there is more to be said ny 2 ee 
about the behaviour of ferro- 5000+ $+ || 
magnetic substances. Let us LAY} ++ + + + 


assume that after the iron has 
been brought to a state of 
magnetic saturation the mag- 
netic field intensity is de- 
creased. It turns out that the 
induction decreases along a 
different curve, i.e., a curve 
lying higher than the initial 
magnetisation curve. The field 
intensity may be decreased to 


zero, but magnetisation re- 0 
mains. The corresponding val- 20 40 60 80 100 120 40 H, afm 
ues of magnetisation and Fig.” 121 


induction at this point are 

referred to as the residual values. To remove the residual magnet- 
isation, the polarity of the field must be changed. Thus, in the case 
of the experiment discussed on p. 282, it would be necessary to reverse 
the polarity of the current in the primary winding. Demagnetisation 
occurs when the field intensity attains a value H,, the value of the 
so-called coercive force. As the current is increased further in the 
same direction, the body begins to become magnetised in the reverse 
direction, i.e., what was previously a south pole becomes a north 
pole. The magnetic flux increases until the magnitude of the satura- 
tion is the same as in the initial process. 

Having attained a negative induction maximum, we may then 
proceed with the process in the other direction. In this manner, the 
hysteresis loop shown in Fig. 122 is obtained. 

It is seen from the figure that knowledge of the intensity of the 
field in which the iron is located is not sufficient to determine the 
magnetic induction and, hence, the magnetic permeability. Thus, 
for example, it is seen that three values of induction are possible 
for H = 400 oersteds—one occurs during initial magnetisation, the 
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second during demagnetisation and the third when the magnetisation 
process is repeated, i.e., just before the hysteresis loop is closed. The 
value of the magnetic induction, and the magnetic permeability, 
depends on the previous “history” of the sample. Hence, the designa- 
tion “hysteresis loop”. 

A hysteresis loop is usually drawn on the assumption that the fer- 
romagnetic body is brought to magnetic saturation. However, we can 
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clearly obtain numerous hysteresis loops having smaller dimensions 
by inscribing them, as it were, in the fundamental loop. For this 
purpose, it is necessary to begin the demagnetisation process before 
reaching saturation. Then, to each H value there corresponds an in- 
finitely large number of values. 

A procedure based on this fact is used to bring a ferromagnetic 
body to a state in which both induction and field intensity are equal 
to zero. This “zero point” is achieved by a series of successive magnet- 
ic reversals, whereby each succeeding cycle is begun at a lower 
intensity level than the previous one. 

The magnetic state of iron cannot be characterised only by the 
value of the permeability, field intensity or induction. Two of these 
quantities must be known—e.g., the induction and the intensity. 
The magnetic state of the iron is then represented by a point inside 
the fundamental hysteresis loop. 

The nature of a hysteresis loop depends to a great extent on the ma- 
terial. A body is said to be magnetically soft if the coercive force (and 
hence the loop area) is small. Typical soft materials are pure iron, 
silicon steel, and iron and nickel alloys (particularly permalloy— 
78% nickel). Carbon and other steels belong to the magnetically hard 
materials and are used in the manufacture of permanent magnets. 

Experiments show that the temperature of a ferromagnetic sub- 
stance rises when it is subjected to magnetic reversals. This is very 
important in electrical engineering, for when iron is placed in a var- 
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iable magnetic field the point on the B ~/(H) curve representing the 
magnetic state of the iron is continuously tracing a hysteresis loop. 
Every time a loop is traced a certain amount of heat is released, 
which according to magnetic field theory is related to the loop area; 
of course, the lower the value of the induction maximum, the smaller 
the loop area. Therefore, empirical formulas may be sought relating 
heat released and maximum induction. In electrical engineering, for 
example, the following formula is Bee used: 


Q= nB ; pe 


where y is a coefficient whose value is given in tables. 


Example. For a good transformer steel, n7=0.0011. When Bmax= 10,000 gausses, 
the losses are equal to Q= yB!. 6 =9.5X 10° ergs/em’=2.5X 1074 joule/em?. 


max 
This means that for magnetic reversals in iron due to an alternating current 
whose frequency v is 50 cps the power loss is equal to 12.5X 107 he per cubic 
centimetre of iron. 


CHAPTER XVI 
ELECTROMAGNETIC FIELDS. MAXWELL’S EQUATIONS 


111. Generalisation of the Law of Electromagnetic Induction 


In the preceding chapter it was shown that the motion of a conduc- 
tor in a magnetic field is accompanied by induction phenomena. 
If this moving conductor is part of a circuit through which the magnet- 
ic flux changes when the conductor moves, a current corresponding to 


the induced enf@=+ flows in the circuit. This current is due 


to the action of a Lorentz force: a force equal to + |B} acts on a unit 


electric charge. 

The induced current depends only on the relative displacement 
of the conductor with respect to the magnetic field. Thus, it may 
be asserted with equal vglidity that a Lorentz force is produced when 
a charge moves in a magnetic field or when the charge is “at rest” 
adie magnetic field moves. This follows from the principle of rela- 

ivity. 

Consider a system of coordinates relative to which a magnetic 
field moves. Such a coordinate system may be fixed, for example, 
relative to a laboratory bench along which the pole of a permanent 
magnet moves. Then, a Lorentz force will act on charges at rest rela- 
tive to this bench. Let us assume nothing is known about the moving 
permanent magnet. Having established that a force acts on the sta- 
tionary electric charges, we are perfectly justified in concluding that 
an electric field exists in this system whose intensity is equal to the 
Lorentz force divided by the magnitude of the charge. Thus the elec- 
tric field intensity in the “stationary” coordinate system, relative 
to which the source of constant magnetic field moves with velocity a, 
is expressed by the formula 


E=+ (0B). 


Of course, the relations differ for the electric field created by charges 
and the electric field created by the motion of the system relative 
to the magnetic field. To begin with, this new field that we are consid- 
ering has no charge sources. This means that the flux lines have neith- 
er beginning nor end. Moreover, it is not difficult to see that the 
flux lines of this electric field form closed curves, i.e., the electric 
field created by the moving magnetic field is rotational. 
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Imagine an arbitrary circuit that is stationary relative to the lab- 
oratory bench. The moving magnetic field crosses this circuit. 
If this imaginary circuit is replaced by a real wire circuit, then in 
accordance with Faraday’s law an emf is induced in the circuit that 


is equal to f Edt. Hence, the integral U=fEdtis not equal to zero. 
This means that the electric field E= + [oB] created by the moving 
magnetic field is a rotational field. 

For a real wire circuit u=1@ where @ is the magnetic flux 


passing through the circuit. However, it is immaterial whether or 
not wire is present at the location of the closed curve. The equation 


1 d ; . , Seiko tps es : 
U=-— =; is also valid for an imaginary circuit in the region where 


the sources of the magnetic field are moving. 

One final generalisation remains to be made. Experiments show 
that the cause for the change in the magnetic field is of no importance 
in the induction effect. The field change due to the motion of a perma- 
nent magnet and that due to achange of current strength in a station- 
ary coil can always be made equal by, for example, bringing the 
permanent magnet closer or increasing the current in the coil creat- 
ing the field. Therefore, the law under consideration must be valid 
in all cases, no matter how the magnetic field is changed. Thus, if 
the magnetic field (magnetic flux) changes in a certain region in 
space, a rotational electric field is produced that is related to the mag- 
netic field change as indicated by the following law. The electromo- 


tive force U= Ed along a closed curve is equal to the derivative 


with respect to time of the magnetic flux passing through this cir- 
cuit: 
1 d® 


ee aE 


or, in the practical system of units, 


do 
U=7 . 
This is the generalised law of induction, one of the most important laws 
of nature. 

Let us examine the mathematical expression for this law. Equating 
the expressions for the electromotive force and the magnetic flux, 
the law can be written as follows in the Gaussian system of units: 


§ Edt =— = %; | Beosads, 


296 Electromagnetic Fields. Maxwell's Equations [Chap. XVI 


and in the practical system of units 
§ Edt = — 5, \ Boos ads. 


First, as regards the minus sign that appears in this equation, it 
should be noted that in vector analysis the direction in which the 
circuit is traversed and the direction of 
the normal to the plane of the circuit 
are related to each other as follows: the 
positive direction of the normal in a 
right-handed screw system is such that 
as viewed from the vector terminus the 
circuit appears to be traversed in the 
counterclockwise direction (Fig. 123). 
Let us construct a closed curve in space 
and ascribe an arbitrary direction to it. 
The direction of the normal to the area 
encompassed by the curve under consider- 
ation is thus determjned. Magnetic flux 

Fig. 123 passes through this circuit. At a given 

instant, it may be positive or negative, 

depending on whether the induction vector forms an acute or obtuse 
angle with the normal. The derivative of the flux with respect to 


n 





) } 
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Fig. 124 


time is positive if the flux is increasing and is negative if the -flux 
is decreasing. Thus, taking the minus sign into account in the 
induction formula, the law may be stated as follows: The elec- 
tromotive force is positive if positive flux is decreasing or nega- 
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tive flux is increasing, i.e., the direction of the electric lines of 
force coincides with the adopted direction for positive. On the 
other hand, the electromotive force is negative if positive flux is 
increasing or negative flux is decreasing. These relationships are 
clearly illustrated in Fig. 124. 

We shall show that the minus sign in the induction formula is the 
mathematical expression of Lenz’s law. Assume, for example, that 
the north pole of a bar magnet approaches a coil and that the posi- 
tive direction in the circuit 
is as indicated in Fig. 125. \ 

Then, the magnetic flux is Pe 
Positive 
direction 






‘positive and so is its deriv- 
ative with respect to time. 
The electromotive force 
must be negative and the 
induced current is opposite 
to the direction adopted as Fig. 125 

positive. We can immediate- 

ly find the magnetic field of the induced current by recalling that 
the flux lines emerge from the side of the current ring from which the 
current appears to be moving in a counterclockwise direction. There- 
fore, as the magnet approaches the circuit, a current of such direc- 
tion is induced in the latter that the field produced tends to oppose 
the action which caused it. This is a statment of Lenz’ law. It is not 
difficult to verify this important rule for other particular cases as well. 

Let us sum up. A varying magnetic field is inseparable from an elec- 
tric field. Moreover, it is seen that the division of fields into electric 
and magnetic is a relative matter. From one viewpoint there is only 
a magnetic field in space. From another viewpoint in addition to the 
magnetic field there is an electric field. 

A rotational electric field consists of electric lines of force that 
link the magnetic induction vectors when the magnetic flux passing 
through a closed line of force changes with time. When the flux 
increases, the direction of the line of force is clockwise as viewed 
from the induction vector terminus. 


112. Displacement Current 


Electromagnetic field theory, whose foundation was laid by Fara- 
day, was mathematically perfected by the English scientist James 
Clerk Maxwell. One of Maxwell’s most important new ideas was that 
symmetry must exist in the interdependence between magnetic and 
‘electric fields. : 

In the preceding article, we discussed the problem of creating an 
‘electric field by varying magnetic flux. The question naturally 
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arises: Does a variable flux of electric lines of force create its own mag- 
netic field? Maxwell answered this question in the affirmative and 
advanced the hypothesis that a relationship exists between variable 
electric flux and a magnetic field that is quite analogous to the gener- 
alised law of induction. According to the hypothesis, if a change 
in electric flux occurs in some region of space, a rotational magnetic 
field is created. Moreover, the magnetomotive force U taken along 
a closed curve is equal to the change in electric flux passing through 
this closed curve, i.e., 


dN 
U= ad ’ 
where 
u= § Hat 


and the electric flux 
N =S DeosadS, 
S 


In the Gaussian system of units 


U 1 dN 


cd * 


The parallel in the relationships between magnetic and electric 
fields does not extend to the sign before the derivative of the flux. 
As is known, when currents are present the magnetomotive force 


along a closed curve is equal to U=/ (or 21 in the Gaussian system 


of units). How should the equation for magnetomotive force be writ- 
ten for such a closed curve that encloses electric current and variable 
flux of electric lines of force? Maxwell assumed that the magnetomo- 
tive forces are additive. Thus, the general formula has the form 


§ Hat =1+94 


or in the Gaussian system 
4m dN 
§ Hdt= 2 (1+) ; 


WY has the dimensionality of electric current 


dt 
strength. Maxwell called it displacement current. He thereby incor- 
porated in this designation the very widespread notion at the end of 
the nineteenth century that the field in a vacuum displaces the par- 
ticles of an “ether” from their positions of equilibrium. This designa- 
tion has continued to prevail in science, although now the presence 


The expression 
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of a field in vacuum is not related to the concept of particle dis- 
placement of any medium whatsoever. In a dielectric medium, the dis- , 


placement current “a may be resolved into two components, corre- 


sponding to the intensity and polarisation vectors into which the 
displacement vector D can be resolved (see p. 252). Therefore, the por- 
tion of the displacement current “flowing” in the dielectric is deter- 
mined by the change in the polarisation vector, i.e., by the relative 
displacements of the centres of gravity of 
the positive and negative charges. 

Before discussing the role of displacement 
current in one or another process, we shall 
prove an important proposition concerning 
the sum of the conduction and displace- 
ment currents. 

Consider an arbitrary system of electric 
currents and imagine a closed surface drawn 
in such a manner that the currents intersect 
it. If the currents are constant it follows Fig. 126 
directly from the law of conservation of 
electricity that the sum of the currents entering the closed sur- 
face must be equal to the sum of the emerging currents, or, to be 
more concise, the algebraic sum of the currents flowing through 
a closed surface is equal to zero. It is evident that this law may not 
be obeyed by variable currents: for example, in the case of a closed 
surface enveloping one plate of a condenser connected in an alternat- 
ing current circuit (Fig. 126) or a closed surface through which the 
top of an antenna protrudes at one point. 

However, this theorem is valid for variable currents as well if 
the term “current” is taken to mean “total current”, i.e., the conduc- 
tion current plus the displacement current, rather than the conduc- 
tion current alone. To prove this, it suffices to consider an arbitrary 
curve on which a surface is based and for which the following relation 
is valid: 





f Hdl = 14 Taispr 


By gradually reducing the closed curve to zero, the surface S on 
which this circuit is based becomes closed as shown in Fig. 127. 
(This process is similar to constricting the opening of a draw-string 
pouch.) The magnetomotive force reduces to zero and hence the sum 
of the conduction and displacement currents passing through the 
closed surface also equals zero. 

Now let us discuss the role of displacement currents in electromag- 
netic phenomena. 
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It can be shown that displacement currents are negligibly small 
where the conduction currents are different from zero. Hence, the 
displacement current within a conductor is always disregarded. 

In calculating the value of the displacement current in dielectrics, 
two cases should be considered — displacement currents in a dielectric 
surrounded by a conductor forming a closed circuit and displacement 
currents that are a continuation of a conductor of an open circuit. 





Fig. 127 


Consider a conductor forming a closed circuit in which electric 
current is flowing and which is intersected by a closed surface. If the 
current is constant, then at each instant the same amount of electric- 
ity passes outward through the surface as inward. The situation is 
different in the case of variable currents. Here, the strength of the ° 
variable current may have different values in different parts of the 
circuit (see below, p.319). As a result, the strengths of the currents pass- 
ing inward and outward through the surface at a given instant may 
not be equal. Then, a displacement current “flows” via the dielectric 
from the point where the current is less to the point where the current 
is greater, and in this manner the current deficit is compensated for. 
Clearly, changes in the displacement current with respect to time will 
exactly correspond to changes in the conduction current. This phenom- 
enon is significant only when the frequency of the current is suffi- 
ciently high. 

If the conduction current does not form a closed circuit, e.g., in 
the case of an alternating current circuit containing a condenser, the 
conduction and displacement currents are simply equal to each other. 
In this case, it may be said that the conduction current circuit is 
closed by the displacement current. 

In spite of the fact that the magnitudes of the displacement cur- 
rents are quite large for such cases, certain calculations may be’per- 
formed without considering them. Thus, when the conduction current 
circuit is closed by the displacement current between the condens- 
er plates, the magnetic field created in this region by the displace- 
ment current is the same as the field that would have been produced 
if the conduction current flowed in an uninterrupted circuit. There- 
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fore, the presence of displacement current does not affect the calcula- 
‘ tion of the magnetic field, the coefficient of self- induction of a sys- 
tem, etc. 


113. Nature of an Electromagnetic Field 


The following equations, which were discussed in the two preceding 
articles, are called Maxwell’s equations: 


$Edt=— and f Hat=14SX. 
These equations concisely sum up our knowledge of electromagnetic 
fields. 

Maxwell’s equations cannot be derived. The discussion of the pre- 
ceding two articles does not constitute a derivation, but constitutes 
rather an illustration of conjectures leading Maxwell to his discovery. 

A large class of phenomena of interest to physicists, electrical engi- 
neers and radio engineers obey Maxwell’s equations. The laws of these 
phenomena are a consequence of Maxwell’s equations and may be 
derived from them. The extraordinary importance of the predictions 
based on Maxwell’s equations gives these equations equal rank with 
Newton’s laws of motion and the principles of thermodynamics 
as fundamental laws of nature. 

We shall not go into the mathematical methods of solving Max- 
well’s equations. It turns out that the above integral equations may 
be transformed into differential equations. Then, by solving Maxwell’s 
differential equations, it is possible in principle to determine the 
electromagnetic field for a given distribution of charge and current. 

Let us again consider the physical essenceof electromagnetic phenom- 
ena as given by Maxwell’s equations. It may be summarised in the 
following manner. 

The division of an electromagnetic field into electric and magnetic 
fields has only relative meaning. If from the viewpoint of an inertial 
system of coordinates only a magnetic field exists, then from the view- 
point of another system moving relative to this system there exists an 
electric field in addition to a magnetic field. The converse is also true, 
namely, if an observer in one system of coordinates finds only an 
electric field present, then an observer in another inertial system will 
find that both an electric field and magnetic field exist. 

Let us now consider an electromagnetic field from the viewpoint 
of an inertial frame of reference, first directing our attention to a region 
of space in which free electric charges and hence conduction currents 
are absent. In this case, Maxwell’s equations have the form 


PHa=* and § Edi=—Z. 
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Both the magnetic and electric fields have a purely rotational charac- 
ter, ie., the lines of force are closed and mutually interwoven: 
electric lines encircle magnetic lines and magnetic lines encircle elec- 
tric lines. The electromagnetic field may be depicted as a chain of 
rings, whereby closed magnetic lines of force are alternately linked 
with closed electric lines of force (Fig. 128). Such a chain exists only 
if the field is variable: A ring of increasing magnetic flux creates about 


Poe: C= L 


itself a ring of electric flux; the varying electric field creates a ring 
of magnetic flux, etc. 

If the region of space under consideration contains charges and 
currents, then in addition to rotational fields with linked lines of 
force there exists a rotational magnetic field whose closed flux lines 
encircle currents and a potential electric field whose flux lines begin 
on positive charges and terminate on negative charges. 


CHAPTER XVII 


ENERGY TRANSFORMATIONS 
IN ELECTROMAGNETIC FIELDS 


(114. Transformations in Steady Current Circuits 


Let us consider a portion of a conductor through which a steady elec- 
tric current is flowing. If the resistance of the conductor segment is R 
and the potential difference across it is U, the current strength is 


determined by Ohm’s law, namely, / = ie . The electric field performs 


work in moving charges along the circuit and the work per unit charge 
is equal to U. Since the current strength is, by definition, the quan- 
tity of electric charge flowing through the conductor cross-section per 
unit time, the product /U yields the work performed by the field 
in moving the electric charge per unit time. This product, /U, repre- 
sents power. If the current is a steady one, this work is completely 
converted into heat (thermal energy). Thus, the formula for calculat- 
ing the thermal effect of current is 


U? . 2 


The transformation of the work performed by the electric field 
into heat occurs at each point of the conductor. To express this mathe- 
matically, Ohm’s law must be converted into a form applicable to 
a point of a conductor, rather than to a portion of a ee 


By introducing the current density j, which is equal ioe z= , whereS 


is the cross-section of the conductor, and by replacing thee expression 
for the potential difference by El, and, finally, by expressing the re- 
sistance in terms of the conductor length / and its cross-section, i.e., 


4 jit 
R=77, we obtain: j=AE. 


Thus, it may be said that thecurrent density is directly proportion- 
al to the electric field intensity. The specific conductivity 4 is the 
coefficient of proportionality and the direction of the current is as- 
sumed to coincide at each point with the ditection of the intensity. 
The formula 


JHE 


is called the differential form of Ohm’s law and should be viewed as an 
empirical law generalising the laws for current flow. in conductors. 
Ohm’s law in its usual (integral) form is a consequence of this equa- 
tion. 

Consider an infinitely small volume element of the conductor, 
dt, in the form of a cylinder whose generatrix dl is parallel to the flux 
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lines and whose base dS is perpendicular to the current. The amount of 
electric charge flowing through a cross-section of the cylinder is 
jdS and the potential difference between the ends of the element is 
Edl. Hence, the work performed by the field in moving the electric 
charge through this volume is equal to jEdt. This formula also gives 
the heat released inside the volume dt. If we are interested in the work 
of the current in a large volume of the conductor, the last expression 
must be integrated. The formula 


> ne 
jE=4-=2E 


gives us the expression for the work of the current or the thermal ener- 
gy released per unit volume of the conductor. 

Thus, in the case of a portion of a direct-current circuit, the energy 
transformations are reduced to the transformation into heat of the 
work done by a field. However, the picture changes as regards the 
energy balance of the entire closed direct-current circuit. The work 
performed by the electric forces along a closed curve when the field 
is constant is equal to zero, for the work performed by the electric 
forces in moving charge along the external portion of the circuit is 
equal and opposite to the work required to move the charge along 
the internal portion of the circuit. Therefore, the release of thermal 
energy in a direct-gurrent circuit occurs only at the expense of the 
energy supplied by the current source—accumulator, electric genera- 
tor, etc.—i.e., at the expense of energy of nonelectrical origin or, 
as it is sometimes said, at the expense of the energy of an “applied” 
force. The role of electric current is reduced simply to the “transfer” 
of energy from the current source to the point where the heat is re- 
leased. The energy that asourceis able tosupply is given by the elec- 
tromotive force @, which by definition is measured by the work 
performed in moving a unit charge along a closed curve. Actually, 
the applied emf performs this work only over those small portions 
of the circuit where the charge must be moved against the forces of an 
electric field. 

The power of the direct-current circuit is given by the expression 
I6. This may be expressed in terms of unit volume if it is assumed that 
the applied forces are distributed throughout the volume. Then, the 
work performed by the applied forces is given by 


jEvr', 


where E”?’ is the “intensity” of the applied forces. 

Designating the work of the applied forces by P, and the thermal 
energy released by Q, the essence of the electrical transformations 
in a direct-current circuit may be expressed by the concise formula 


P—Q=0. 


Sec. 115} Transformations in a Closed Circuit of Variable Current 305 


Example. For an feotated copper wire of cross-section S=4 mm?, the permis- 
sible current density in the case of an open wire is j=9 x 10° amps/metre?. A 
1-metre length of such a wire has a resistance of 4.25 107% ohm. For the indi- 
cated value of j, a current of /=36 a flows in the wire and the thermal energy 
losses per second for this portion of the circuit amount to /? R=1,296X 4.25X 
X 107*6 joules, i.e., in a unit volume 0.33 cal are released each second. 


115. Transformations in a Closed Circuit of Variable Current 


A flow of variable current is inevitably accompanied by induction 
effects. Thus, to a variable current strength therecorresponds a vari- 
able magnetic flux MD. Here, Drepresents the number of lines of force 
which are created by the current circuit and which pass through the 
conducting circuit. In this case, the induction effects are due to the 
current’s own magnetic flux, whence the designation self-induction. 


Since @ is continuously changing, an induced emf @'"4=— & : 
exists in the current circuit at each instant, in addition to the ap- 
plied emf. 

The magnetic flux is always proportional to the current to the 
first power. Hence, the formula ®=L/ has universal validity. The 
coefficient L is called the inductance of the circuit or the coefficient 
of self-induction. The value of L depends on the geometric properties 
of the circuit and the nature and distribution of magnetic bodies in 
the system. It does not depend on the conditions under which the 
system of conductors and magnetic bodies operate. Thus, for self- 
induced emf the following equation holds: 


ind dd 
ert a 


The significance of the minus sign in this formula may be explained 
as follows: When the current increases, the induced emf opposes the 
applied force, i.e., the induction is in the direction opposite to the 
applied force. On the other hand, when the current decreases, the 
directions of the induced emf and the applied emf coincide. This 
is the reason for the analogy generally made between mechanical 
inertia and self-induction. Self-induction impedes an increase as well 
as a decrease in current. 

Ohm’s law, relating emf and current strength, remains valid. 
Therefore, the product of current strength and total circuit resistance 
will’ at éach instant be given by the following relation: 


IR= gurl + gird gurl mf a , 
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Multiplying both members of the equation by the instantaneous cur- 
rent strength, we obtain the energy equation: 


= appl al 
PR=1G? —LIi-=. 


Here, 6%?! — P is the work of the applied forces and /*R =Q is the 
thermal energy. It is seen that in a variable current circuit these two 
quantities are not equal to each other. The difference P—Q is equal 


at each instant to Lid, i.e., it is equal to the derivative of 
1 


5 LI’. In other words, the excess of the work of the applied forces 
over the thermal energy released goes to increase the magnitude of 
SL On the other hand, the excess of the heat released over the 
work of the applied forces occurs at the expense of the magnitude 
of + LP. The equation 


d 1 
P—Q=4 (+ LP) 
is the expression for the law of conservation of energy. 


Clearly, the quantity W = Ee 5 represents energy. It is the mag- 
y D} 


netic energy of a system that is inseparably linked with the existence 
of a magnetic field in it. (In the Gaussian system of units the expres- 


sion for magnetic energy is > LI’. \ There is magnetic energy in a 


direct-current circuit too, but in this case it does not manifest itself 
since it remains unchanged. The induction effects occur only when the 
current is switched on and off. When the circuit is closed the applied 
forces perform work, which is expended not only in the release of 
heat, but also in the storage of magnetic energy. On the other hand, 
when the circuit is opened the thermal energy released is at the 
expense of the magnetic energy of the current. 

The magnetic energy formula may be verified experimentally by 
closing or, even better, opening a current circuit. The thermal ener- 
gy released after the source is disconnected is numerically equal to 
the magnetic energy of the current. If the coefficient of self-induction 
is large, the release of heat continues over a period of time sufficiently 
long to enable us to measure the heat by, for example, calorimetric 
means. 

Inductance may be measured in various ways and in the simplest 


cases may be calculated from the formula L = =. The problem is re- 


duced to the calculation of the magnetic flux passing through the 
system. 
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An expression for the inductance of a ring solenoid will be required 
below. The magnetic flux through one turn of a coil is D=p,pHS, 
where S is the area of the turn, and the flux through n turns is D= 
=n wHS. Substituting the expression for the field intensity (using 
the practical system of units), we obtain 


I 
O= np pws <n ‘ 


Now, dividing both members of this equation by the current strength, 
we obtain an expression for the inductance of g coil (also approxi- 
mately valid for a straight solenoid): 


2 
L=pyp-> ee 


The inductance of a coil is directly proportional to the magnetic 
permeability of the medium and increases sharply with the number of 
turns. Anincrease in inductance is achieved by using iron or by increas- 
ing the number of turns. In order to make clear the relationship 
existing between the coefficient of self-induction and the dimensions 
of the coil, let us multiply the numerator and the denominator by J. 


Then, 
L=pu,p (+) V 


and it is evident that the inductance is directly proportional to the 
volume occupied by the magnetic field and to the turn “density” 
squared. 

‘ Example. Consider a long solenoid of small cross-section (-=15 cm, n=1,500 


turns, S=1 cm? and /=0.1 a). At the centre of the solenoid, the magnetic flux 
is O=np.pHS=6nX 10~* volt sec. The inductance of this solenoid is 


{1 S00F 1074 = 1,9X 107? henry. 


L= po SS =4nx1077X1 x GE 


In practical units the inductance is measured in henrys (1 henry=1 ohm sec), 
The inductance of coils in radio engineering is measured in millionths and thou- 
sandths of a henry. Chokes having iron cores can attain inductance values of 
the order of a number of henrys. 


116. Magnetic Energy of a Field 


In the chapter devoted to electric fields, it was shown that the elec. 
tric energy of a system may be viewed as a quantity whose density 


distribution is represented by Sek? (in the practical system of units). 


The electric energy of the system is then determined by integrating 
this expression over the region occupied by the field. The importance 
‘of this circumstance was emphasised as it enables us to express the 


11" 
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energy in terms of the field intensity and it confirms our conception of 
a field as something that can be localised. 

Naturally, we expect the situation to be similar for magnetic 
fields and this is indeed the case. It can be mathematically shown 


that the transition from the magnetic energy formula, 5 LI’, to 


the expression for the magnetic energy density, + we, is com- 


pletely analogous to the corresponding transition from electric fields. 

Let us consider this transition for the simple case of the uniform 
field of a ring solenoid. Substituting the expression for the inductance 
in the magnetic energy formula, we obtain 


n\2 
ee Sos ker 


m 


But a is the field intensity. Hence, the magnetic energy of a coil 

may be written in the form 

Wb y 
m 9 , 


so that the magnetic energy density is given by the expression 
wo, = bat 

In the Gaussian system of units 
Wm = pe WH. 

Thus, for any system of currents, the magnetic energy may be rep- 
resented by the integral over the volume occupied by the field: 
nae | aH? dt (in the practical system of units) 

and 


W = gy | wide (in the Gaussian system of units). 


Now, consider the magnetic energy of two currents. The expression 
for this energy naturally divides into three integrals if the intensity 
of the resultant field H is viewed as the sum of the field intensities 
of the two currents: H=H,+H,. In the following expression for the 
ee ag energy, the significance of each of the integrals is quite 
evident: 


Wai lant dv +p, | pH H, de+< | wii de. 
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The first and third integrals yield the magnetic energy of the first 
and second currents, respectively, while the second integral repre- 
sents the interaction energy of the two currents. This last integral 
may assume different values even though the magnitudes of the field 
intensities, H, and H,, do not change. Thus, if the mutual disposi- 
tion of the two currents changes, the field vectors H, and H, are 
turned, generally speaking, relative to each other and the value of 
the interaction energy also changes. 

- Of course, the first and third integrals may be expressed in terms 


of the current strength and the inductance: See and eine 


As for the second integral, it is clear that its value is proportional to 
the product of the current strengths. Thus, 


| nH, dt=MI,I,. 


The coefficient of proportionality M is known as the coefficient of 
mutual induction. Just as in the case of inductance, M depends on the 
geometry of the system and the distribution of magnetic bodies. 

Thus, it is evident that the change in the magnetic energy of a 
system of currents is related not only to the work of the applied forces 
and the thermal energy released, but also to ‘the work performed 
by the field in moving the conductors under the action of an Ampére 
force. Hence, the law of conservation of energy requires that the fol- 
lowing equation be satisfied: 


dW ,=— A—(Q—P) dt, 


where A is the meghanical work. Thus, it may be stated that, in the 
general case, the magnetic energy expended is equal to the work of 
thoving the conductors and to the excess of released thermal energy 
over the work of the applied forces. 

The relations introduced in this article do not take into account 
one phenomenon—magnetic hysteresis. This problem will not be 
dealt with because of its specialised nature. 

Example. The energy stored in the magnetic field of the coil described in 
the example on p. 307 is ~ 

W py = oe we LORIE DT = 0.95 X 1075 joule. 
The energy eel is 


Wm = Pot = 421X107 re Relea eee 


= 0.63 joule/metre’, 
Naturally, the same result could be obtained iy dividing the total energy of 
the magnetic field by the volume of the coil: 


Wm 


On-G ° 
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117. Electric Oscillations 


The processes of transforming electric energy into magnetic energy 
and vice versa are of fundamental importance in electrodynamics. 
Asimple system in which such transformations occur is a charged elec- 
tric condenser whose plates are connected at a certain instant to the 
ends of a coil (Fig. 129). When the con- 
denser discharges, an electric current 
flows through the coil and creates a mag- 
netic field around it. At each instant, 
the electric field of the condenser and 
the magnetic field of the coil are closely 
C linked. The energy of this system at each 
instant is equal to the energy of the 
electric field, which is concentrated 
mainly between the condenser plates, and 
the energy of the magnetic field, which 
is concentrated mainly inside the coil. As is well known, electric 
oscillations arise in such a circuit and we shall now show that 
such oscillations are inevitable. 

To begin with, let us disregard thermal energy losses. Then, the 
law 4 Copestyation of energy requires that the following equation be 
satisfied: 


Fig. 129 


-i@,1 tes 
W=+-—G+4 LP =const. 
The sum of the electric energy and the magnetic energy is the same 
at each instant. Hence, the derivative of the above expression with 
respect to time is equal to zero: 


dW Q dQ Idl__ 
aE ed heap 


Since the current strength is equal to the decrease in charge on the 
condenser plates, i.e., 
2 
dt ’ 


the equation may be simplified as follows: 


Q dil _ 
+L Z=0. 


Such a relationship between the charge on the plates of a condenser 
and the current strength, which is equal to the derivative of the charge 
with respect to time, can be satisfied only if harmonic oscillation of 
the charge and the current is assumed. 


Sec. 117] Electric Oscillations 3il 


This becomes evident upon comparing the above relations with 
the equations for mechanical vibrations (see p. 87): - 


dQ dx , 
oad Ea 

dl 1 do 
ae ee, gee 


Charge and current, on the one hand, are analogous to displacement 
from equilibrium and velocity of motion, on the other. As for the 
parameters of the system—inductance is analogous to mass and recip- 
rocal capacitance is analogous to the rigidity of the system. 

Let the initial time equal the instant when the condenser is fully 
charged, and assume that 

Q=Q, cos wt. 

Then, 


[=—Q, osinot. 
Substituting in the differential equation, we obtain 
—LQ,w’ cos ot = — “ Q, cos wt 


or, after cancelling, 
1 


o= Vit: 


Thus, irrespective of the initial charge on the condenser plates, the 
harmonic oscillations occurring in the condenser have a natural fre- 


1 
quency = 7774. The smaller the capacitance and inductance of 


the circuit, the higher the frequency of the electric oscillations. 

What is the situation in a real circuit where the thermal energy 
losses cannot be neglected? Clearly, the total energy of the system in. 
this case will decrease in accordance with the equation 


: : 1, 4Q dl 
dW=—I’Rdt, ie., —PR=—Q-Gtllg: 


Differentiating again with respect to time and using the relationship 
between charge and current, we obtain -an equation of the form 


a] al 1 
Loge tRat oe i=. 


At this point, an analogy should be drawn between the corresponding 
electric and mechanical quantities. Comparing the last equation 
with the equation for mechanical vibrations with friction (p. 91), 
it is seen that the electric resistance is analogous to the coefficient a, 
which is a measure of the mechanical resistance. 
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The solutions of such linear differential equations are considered 
in courses in advanced mathematics. We shall simply give the final 
result, which incidentally is easily verified by substitution in the 
above equation: 

I= I,e~*' cos ot. 


The frequency of oscillation is given by 
o= V o—B* 


Thus, the process is determined by two characteristics—the natural 





frequency of free-undamped oscillations, o, = Vic’ and the damp- 
ing coefficient, p= . It is seen, first, that light damping is 
achieved by decreasing the resistance relative to the inductance. (To be 
sure, this is not easily done, for if we increase the number of turns on 
the coil, both quantities increase simultaneously. However, L does 
increase at a faster rate.) Secondly, it will be noted that when 


a<p*, ie, 4L<CR?, 


oscillations become impossible. The discharge of the condenser under 
such conditions leads to an aperiodic process analogous to the return 
swing of a pendulum displaced in a viscous medium from its equilib- 
rium position. 


| Example. Assume that we have a variable condenser whose maximum ca- 
acitance C=500 pf. Calculate the corresponding inductances of the radio coils 
ier ‘the 1,500 metre and 15 metre wavelengths. 
1. The frequency of electric oscillations corresponding to 41=1,500 metres 
eo cps=200 ke/s. ‘ 
ince 


o=2nv,= 





, then L,= = 1.2«107* henry = 1.2 millihenry. 








VL,C 


In order for the process in the circuit to be periodic, the resistance of the cir- 
cuit must be less than R= 2 fun = 3,000 ohms. 


2. Ae=15 metres, v,=2X 107 cps=20 me/s, Le=0.12« 107* henry=0.12 
microhenry. In order Hor “oscillations to be possible, the resistance of the circuit 


must be less than R=2Y & = 30 ohms. 


118. Electromagnetic Energy 


In a system in which the oscillatory circuit consists of a condenser 
and a coil (particularly if the condenser is composed of large plates 
separated by a short distance and the coil has a large number of turns), 
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the electric and magnetic fields are concentrated in their respective 
regions. Therefore, it is possible to consider the electric and magnetic 
energies as two related, but nevertheless distinct quantities. This 
division loses physical significance to a large extent when we consider 
rapidly varying fields, where large electric and magnetic fields exist 
in the same regions. 

Recalling what was said in Sec. 113 about the relative nature of the 
division of an electromagnetic field into electric and magnetic com- 
ponents, it should be understandable that it is necessary to introduce 
into the theory the concept of an électromagnetic energy that is for- 
mally equal to the’sum of the electric and magnetic energy of the 
field. The density of electromagnetic energy in space is 


4 ; 
w= —— (eE* +H’), 
while the electromagnetic energy contained in the volume V is 


w= a ( (cE? + pH?) dV. 
" 


In rapidly varying fields, the physical significance of the transfor- 
mation of magnetic energy into électric energy, and vice versa, is 
lost. At the same time, any energy transformations occurring in an 
electromagnetic field must be taken into account in the energy bal- 
ance by a single electromagnetic energy quantity. : 

If the above expression for electromagnetic energy is assumed to be 
valid, then, using the electromagnetic field equations of the preced- 
ing chapter, the following theorem for the decrease in electromagnetic 
enerey within a certain volume of space can be rigorously proved: 


— i =(P—Q + K cosads. 


This theorem was proved in 1884 by Poynting. (It was proved 
in a more general form, i.e., not in connection with an electromagnetic 
field, by N. A. Umov in- 1874.) The integral on the right is the flux 
of the vector K.* Using calculations that we have been forced to omit 
due to their complexity, it can beshown that this vector is perpendic- 
ular to the plane passing through the field vectors E and H (Fig. 130), 


and is equal to K==— [EH] in the Gaussian system of units agd 
to K=[EH] in the practical system of units. 


_* It should be recalled that in mathematics an expression in the form 
§ Ads is called the flux of vector A through the surface S, 
S' : . 
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Since the values of the field intensities decrease quite rapidly with 
increasing distance from the field sources, the flux of the Poynting 
vector is reduced to zero when all of space is taken into account. 
In this case, the theorem states: The change in electromagnetic 

energy is equal to the excess of the 

£ work of the applied forces over the 
released heat. 

However, of most interest is the 

application of the theorem to a 

finite volume, i.e., when the flux 

of the Poynting vector is not equal 

to zero. If the volume under consid- 

eration does not include currents, 

k the equation assumes the form 


—F = K cosads. 


The change in electromagnetic en- 
ra ergy is equal to the flux of the Poynt- 
Fig. 130 ing vector through the surface bound- 
ing the volume under consideration. 

The Poynting vector characterises the flux of the electromagnetic 
energy and the last equation expresses the following fundamental 
concept: A change in electromagnetic energy within some volume is 
accompanied by an outflow or inflow of an equivalent amount of 
energy. 

Ty execs Poynting’s theorem is a direct consequence of the law 
of conservation of energy and the postulate stating that electromag- 
netic energy can be localised in space. 

If Poynting’s vector really has the significance of energy flux it 
should be related to the energy density as follows: K=vw (cf. p. 116 
where an analogous problem is considered relative to the propagation 
of elastic waves in a medium). By means of Maxwell’s theory, we 
can determine v, the propagation velocity of the electromagnetic 
energy. It turns out that 

c 

v= Veu . 

Thus, in vacuum electromagnetic energy should be propagated at a 
velocity c=3 x10"° cm/sec, which agrees excellently with experi- 
ment. The coincidence between the values of c determined from purely 
electrodynamic experiments (e.g., the measurement of the interaction 
between two currents) and the value of this constant determined by 
direct measurement of the propagation velocity of electromagnetic 
waves is remarkable and may be taken as practically conclusive proof 
of the validity of Maxwell’s theory. 
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In a medium, the value of the propagation velocity of an electro- 
magnetic wave is c divided by Vep.We shall see below under what 
conditions this relationship is satisfied and it shall be explained why 
certain deviations occur. 

Let us now return to the consideration of 
energy transformations in finite regions of space 
that contain conduction currents. 

Assume that in the region under investigation 
there exists a cylindrical conductor of radius r 
through which a current of density j flows. 
The intensity of the magnetic field at the surface 
of the conductor (cf. p. 276) is equal in the 


Gaussian system of units to H= =e rj and the 


magnetic flux lines form circles about the cur- 
rent axis. It can be seen (see Fig. 131) that 
Poynting’s vector is directed into the conduc- : 
tor, for the field intensity and the current Fig. 131 
vector have the same direction. As for the nu- 

merical value of Poynting’s vector, we obtain (at the surface of 
the conductor): 


£ 





c c fj ne GE 
Now, let us determine the flux of Poynting’s vector in a conductor 
segment of length J. This flux is equal to 


K x 2arb=t aeps L V, 


where V is the volume of the conductor segment. But f is simply the 


thermal energy released per unit volume of conductor. We have thus 
shown that the flux of the Poynting vector enters the conductor and 
transfers to it an amount of energy that is exactly equal to the energy 
expended in heat. 

Where does this flux come from? In exactly the same manner as 
above, it can be shown that the flux of energy comes from those por- 
tions of the conductor where applied forces, are present. 

This picture explains how electromagnetic energy is propagated 
along conductors. When electric power is transmitted from Kuiby- 
shev to a consumer in Moscow, the energy is delivered by electromag- 
netic waves and not by the first electrons initiating the motion along 
the conductor. 

Examples. 1. Let us determine the order of magnitude of the electromotive 


force produced in the antenna of a radio receiver located at a distance R=100 km 
from a transmitter whose power is P=100 kw=10* joules/sec. 
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The numerical value of Poynting’s vector at the location of the receiving 
antenna is 
P 105 


K=7yF = Frio = 8X 10-7 joule/metres? sec (watt/metre’). 


In the Gaussian system of units, the E and H vectors have the same dimen- 
sions (gm'/2 cm™~!/: sec ?). It can be shown that for an electromagnetic wave propa- 
gating in vacuum the numerical values of the E and H vectors in the Gaussian 
system of units are equal: E’=H’. For these quantities, the following relation- 
ships exist between practical and Gaussian units: 


1 voltimetre= 4-107 statvolt/em; 1 amp/metre=4x107° oersted. 


Then the numerical values of the E and H vectors in the practical system of 
units are: 


E=+X107E', H=4nxX1074H". 


Therefore, for an electromagnetic wave (E’=H’), we obtain: E=1200H. In 
the practical system K=EH; hence, K= a and E=Y120nK =1.7x 
x 107? volt/metre. 

This means that the potential difference produced in a receiving antenna 
having a length of I metre is of the order of 20 inv. 

2. Compare the value obtained above for K with the value of the solar con- 
stant, i.e., the energy that would arrive from the Sun each second on | cm? 
of the. Earth’s surface if there were no losses in the atmosphere: 


K sun=0,15 watt cm?=1,500 watts/metre?, 


119. Momentum and Pressure of an Electromagnetic Field 


According to the theory of relativity (see p. 420), matter which pos- 
sesses energy also possesses mass. The relationship between mass and 
energy is given by the equation E =mc’, where ¢ is the propagation 
velocity of light. As we already know, the energy of an electromagnet- 
ic field may be considered to have the following density distribu- 
tion in space: 


, 
W = —— (eE* + pH’), 


Thus, a unit volume of electromagnetic field possesses a mass of 
Ww 


ce 

Since moving matter possesses mass, it must also have a momentum 
equal to the product of the mass and the velocity of motion. We con- 
clude, therefore, that a unit volume of electromagnetic field has a 
momentum 


w 
ee eee 


This expression is appropriately’ called momentum density. 
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As stated earlier (p. 314), since Poynting’s vector has the signifi- 
cance of energy flow, tt must be related to the energy density in accord- 
ance with the formula K=we. Comparing the last two formulas, 
we see that the relationship between the momentum density and 
Poynting’s vector is given by the expression g=K/c’, where c is the 
velocity. 

Since a flow of electromagnetic radiation possesses mass and mo- 
mentum, it will exert pressure on a surface placed in its path. The 
magnitude of this pressure may be expressed in terms of the momen- 
tum density and may vary depending on whether the surface absorbs 
or reflects the wave energy. Of course, intermediate cases are also 
possible. . 

In the time At, the electromagnetic field included in a volume 
ScAt strikes the surface S. If total absorption occurs, a momentum 
equal to gScAt is lost in this time. But momentum divided by time 
is force, and force divided by area is pressure. Hence, the pressure 
exerted on the surface absorbing electromagnetic energy is equal to 
p-=ge, the product of the momentum density and the velocity of light, 


or, since g=—, the pressure is equal to the energy density w. 


Now, let.us consider an ideal elastic encounter between the field 
and the surface. If all the energy of the electromagnetic field (wave) 
is reflected, the change in momentum will be twice the incident mo- 
mentum, for the latter has reversed its direction. Just as in the pure- 
ly mechanical cases (p. 66), the force of an elastic impact is twice 
‘as large as the force of an inelastic impact. Hence, the pressure exert- 
ed by the wave on an ideally reflecting plate is 


p=2gc or p=2w. 


The formula for the general case is now easily obtained. If the 
plate reflects part of the energy and the coefficient of reflection is 
equal to 9, the pressure of the electromagnetic flux (wave) is given by 
the expression 

p=w(l—e)+2qw=(1+e)w. 


Using light, P. N. Lebedev verified these formulas experimentally 
in 1900 and thus greatly contributed toward the development of qur 
present conception of the nature of electromagnetic waves. The pres- 
sure of light is exceedingly small even for the most intense sources. 
For example, the pressure of light on a mirror located at a distance 
of 1 metre from a “lamp” of 1 million candle power is of the order 
of 10-* dyne/cm?. That is why Lebedev’s measurement of the pres- 
sure of light with an accuracy of 1-2% is viewed as a great experi- 
mental achievement. 
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Basically, Lebedev’s apparatus consisted of a pair of vanes at- 
tached to a light-weight suspension. One vane was an excellent absorb- 
er of light and the other an excellent reflector. The light was direct- 
ed first at one vane and then at the other, and, then, by comparing 
the displacement angles, it was possible to determine the magnitude 
of the force. The chief difficulty was how to take into account the ef- 
fect on the vanes of residual gas heating in the vessel containing the 
suspension. . 

As we have just seen, the theory of variable electromagnetic fields 
has led to the conception of a field as a physical reality (electromagnet- 
ic radiation). The great merit of Lebedev’s experiments is that they 
provided direct proof of the validity of this conception. 

An electromagnetic field possesses energy and momentum, is 
propagated in space with a specific velocity and exerts pressure on an 
obstacle. We shall see below (p. 558) that an electromagnetic field 
may be transformed into matter. All these facts taken together irre- 
futably prove that an electromagnetic field is a physical reality. 


CHAPTER XVIII 


ELECTROMAGNETIC RADIATION 


120. Elementary Dipole 


Electromagnetic radiation occurs whenever a variable electromag- 
netic field is created in space. An electromagnetic field, in turn, va- 
ries in time whenever the distribution of electric charge in a system 
changes or the density of an electric current varies. 

Thus, every variable current and pulsating electric charge is a 
source of electromagnetic radiation. 

Magnetic and electric dipoles having variable moments—particu- 
larly the latter—are the simplest systems producing electromagnetic 
fields. A system consisting of a stationary positive charge about which 
a negative charge oscillates constitutes such an electric dipole. 
If the oscillation is sinusoidal, the dipole moment will also be sinu- 
soidal, i.e., it is represented by the formula p =p, cos of. This simple 
radiator model has very great significance since many real systems 
can be represented to a high degree of accuracy by ideal dipoles. 

It will be recalled (see Sec. 93) that the electric properties of a sys- 
tem whose “centres of gravity” of positive and negative charge do not 
coincide may be described in terms of the dipole moment of the 
system. But most radiators of electromagnetic energy are electrically 
neutral systems whose positive and negative charges are capable of 
being displaced relative to each other. This is primarily because 
atomic and molecular systems fall under this heading. An electron 
rotating about the nucleus of an atom is a system having a variable 
dipole moment, and a neutral molecule whose atoms are in a state of 
oscillation is also, frequently, a system having a variable dipole mo- 
ment. However, our interest in the electric dipole extends further. 
In the following article, we shall see that a linear radio antenna may 
be likened to a dipole. (Incidentally, the analogous terms “oscillator” 
eh “vibrator” are somewhat broader in meaning than the exact term 
“dipole”.) 

Magnetic dipoles occur when the electric charge distribution and 
hence the dipole moment of the system remain unchanged while the 
current density and hence the magnetic moment of the system change 
with time. A typical example is a loop in which an alternating elec- 
tric current flows. If the current flows in a closed circuit, the electric 
charge is neither accumulated nor dissipated anywhere. The electric 
dipole moment of such a loop equals zero and does not change. How- 
ever, the loop’s magnetic field, which is related to the value of its 
magnetic moment, varies and, therefore, electromagnetic energy is 
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radiated. It follows from the theory that if a system possesses simul- 
taneously an electric and a magnetic moment the radiation from the 
magnetic dipole at large distances from the source is usually much less 
than the radiation from the electric dipole. 

If a dipole radiates by giving up internal energy or, as in the case 
of an antenna, by transforming the energy of an external source into 
radiation energy, the dipole is called a primary radiator. However, a 
secondary radiator is also of considerable interest. In this case, a 
-dipole is made to oscillate by the action of an electromagnetic wave 
and becomes a radiator only as a consequence of this action. Secon- 
dary oscillations are particularly intense when the primary wave 
is of the same frequency as the natural frequency of the dipole (reso- 
nance). 

Setting a dipole into an oscillatory state may be viewed as a 
mechanical process—the jostling of the charges by an external force 
equal to the product of the charge and the field intensity. At the same 
time, the process of creating secondary oscillations in a receiving 
antenna may be viewed as an induction process in which an alternating 
electric current is produced by an alternating magnetic field. To the 
extent that the antenna may be replaced by a dipole, both views are 
equivalent. 


121. Antennas as Electric Dipoles 


An important difference exists between the state of oscillation of 
an oscillatory circuit (p. 310) and the oscillation of the current in an 
antenna. In discussing the electric oscillation of a circuit, we referred 
to a definite instantaneous current strength and a definite instantane- 
ous charge on the condenser plates. It was assumed that the current 
strength in all parts of the circuit was the same, that the electric charge 
was concentrated on the condenser plates and, hence, at a given 
instant could only have a single value. 

The electric oscillation in the case of an antenna cannot be viewed 
in the same manner as the oscillation of a pendulum. However, the 
oscillation of an electric current in an antenna does have a mechanical 
analogue. This oscillation is very similar to the vibration of a rod or 
string, i.e., it can be represented by a standing wave. 

This may be strikingly demonstrated by showing that an excited 
antenna has current nodes and antinodes. A small bulb may serve as 
the current indicator (Fig. 132). It turns out that a conduction current 
antinode exists at the centre of a free section of line in which electro- 
magnetic waves are excited and that conduction current anodes exist at 
the ends. In such a line, the current at all points is unidirectional at 
each instant. At some instant the current at each point decreases to ze- 
ro and then begins to flow in the opposite direction. The electric charge, 
which is distributed continuously along the line, varies accordingly. 
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Clearly, as long as current flows in one direction, positive charge is 
accumulated on one half of the line and negative charge is formed on 
the other. When the current decreases,to zero, the charges at the ends 
are a maximum and of opposite sign. The current then begins to 
flow in the opposite direction and the charges decrease, becoming 
zero when the current strength in all parts of the line is a maximum. 
At this instant, the recharging process commences, charges of 
opposite sign are accumulated on the two halves of the line, ete. 





Fig. 132 


The reader will note that at each instant charges of opposite sign 
are located on the two halves of the line. Two charges that are equal 
in magnitude but opposite in sign, and separated by a certain dis- 
tance, constitute an electric dipole. It can be stated, therefore, that 
the electric oscillations of an antenna are very similar to the oscilla- 
tions of an electric dipole in which the dipole moment decreases from a 
maximum positive value to zero, then increases in the opposite direc- 
tion, then again decreases, etc. 

The field of an antenna differs from that of a dipole only in the 
region close to the antenna. At distances hundreds of times greater 
than the dimensions of the antenna, the field created by the antenna 
does not differ from the field created by an ideal electric dipole. 

Let us again return to the analogy between an antenna and a rod. 
The natural frequencies of electric oscillations that can exist in 
free, ungrounded antennas are not restricted to the frequencies deter- 
mined from the simplest case, i.e., when a half wavelength is impressed 
on the length of the antenna, although such half-wavelength dipoles 
are mainly used in UHF engineering. The following relationship 


exists between the length of the antenna and the wavelength: L=n->. 


(see p. 132). Thus, an antenna of Iength L can receive and radiate waves 
of wavelength A satisfying the above relationship. 
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In the field of radio, anumber of methods exist for varying the nat- 
ural frequencies of an antenna. Basically, they consist in the connec- 
tion of a self-inductance coil or a condenser to the antenna. By vary- 
ing the inductance or capacitance, the natural frequencies of the anten- 
na may be varied within broad limits. 


122. Radiation Pattern of a Dipole 


The radiation pattern of a dipole may be determined experimental- 
ly. It turns out that the results are in complete accord with the theory 
first advanced by Hertz. We shall be concerned only with the results 
of experiments and theoretical calculations, restricting ourselves to 
the field far removed from the dipole, i.e., to the so-called wave zone. 
This is the region in which the distances to the dipole are considerably 
greater than the dipole dimensions. 

Irrespective of the complexity of dipole oscillations, the oscilla- 
tions may always be resolved by means of Fourier’s theorem into 
their spectra, i.e., they may be represented as the sum of harmonic 
oscillations of frequencies @, 2, 30, etc. Therefore, it is quite suf- 
ficient to consider the electromagnetic field of a dipole whose moment 
varies in accordance with the harmonic relation p=p, cos of. 

Calculations and experiments show that the field of such a system 
may be represented by a spherical wave propagating with a velocity 


v= rm The electric and magnetic vectors of the wave are at right 





angles to each other and also at right angles to the direction of prop- 
agation. The latter circumstance, incidentally, follows from Poynt- 
ing’s theorem. 

In the wave zone, the electric and magnetic vectors vary in phase, 
performing harmonic oscillations at every point in space. A simple 
relationship exists between the numerical values of the field inten- 


sity vectors, namely: 2 7 
VeE=VudH. 


Hence, Poynting’s vector may be written in the following form: 


es eee 


Thus, the wave intensity, i.e., the energy passing through unit area 
per unit time, is proportional to the amplitude squared of the elec- 
tric field intensity. 

The radiation of a dipole is not the same in all directions. The am- 
plitude, as well as the intensity, depends on the angle of inclination 
of the propagation direction to the axis of the dipole. In the direc- 
tion perpendicular to the dipole axis the radiation is a maximum, 
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while in the direction of the dipole moment it is equal to zero. 
Theory gives us the following expression for the electric field intensity: 

Do®” R 

E= eR sin cos o (¢——) ; 

where the factor before the cosine is the wave amplitude of vector E. 
The angle @ is the angle between the propagation direction and the 
dipole axis. The expression for the magnetic field deviates from the 
above only with respect to a slight difference 

in the amplitude factor. 

Fig. 133 shows a diagram sometimes used 
to represent the dependence of radiation in- 
tensity on direction. Here, a radius vector is 
shown intersecting the radiation pattern. If 





a f 


Fig. 133 Fig. 134 


the scale is known, the radiation intensity is “given by the length 
of the vector measured to the point of intersection. 

The fact that the amplitude is proportional to the radiation frequency 
squared is very important. Clearly, the radiation intensity of the 
dipole depends very greatly on the frequency, i.e., it is proportional 
to the frequency to the fourth power: 

K~ Sy sin? 6. 
Thus, when the frequency is halved, the intensity decreases to is 
of its original value. 

Theory has led to an important conclusion regarding the orthogonal- 
ity of an electromagnetic wave. This is illustrated in Fig. 134, where 
it is seen that the electric and magnetic vectors are perpendicular 
to the direction of propagation. As a result, the properties of an elec- 
tromagnetic wave change when the wave is turned about the direction 
of propagation. This phenomenon is known as polarisation. 
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Mapping the flux lines of a radiating dipole is of no particular in- 
terest. Fig. 135 shows vectors of electric field intensity for several 
points in space. The field is rotational 

dS and the flux lines are closed. When 

: radiation occurs, the closed lines ex- 

pand in. the direction away from the 

ae radiator. The magnetic lines of force 

WwW consist of circles about the dipole axis. 

At great distances from the dipole, 

\\, the spherical wave practically does 

not differ from a plane wave. Of course, 

the orientation of the FE, H and K 


§ £ \\ tttttt vectors in a plane wave and the nu- 
merical relations given above remain 
the same. 


W 123. The Electromagnetic Spectrum 


a Mf According to theory, electromagnet- 
ic radiation occurs when electric 
charges are accelerated nonuniformly. 


a A uniform or free flow of electric 
. charge does not produce radiation. 
Fig. 135 Charges moving under the action of a 


constant force, e.g., charges describ- 
ing a circle in a magnetic field, also do not radiate. 

In oscillatory motion, the acceleration is continuously changing. 
Hence, electric charge oscillations produce electromagnetic radiation. 
Electromagneticradiation also occurs when charges are abruptly decel- 
erated. Thus, when a beam of electrons impinges on a target, X-rays 
are produced. Electromagnetic radiation also occurs during random 
thermal motion of particles (thermal radiation). The pulsations of a 
nuclear charge produce an electromagnetic radiation known as y-rays. 
Ultraviolet rays and visible light are produced by the motion of 
atomic electrons. Electric charge oscillation on a cosmic scale is 
exemplified by the radiation of radio waves by heavenly bodies. 

In addition to natural processes in which various kinds of electro- 
magnetic radiation are produced, a number of experimental means 
exist for creating electromagnetic radiation. 

The main characteristic of electromagnetic radiation is its frequency | 
(in the case of a harmonic oscillation) or its frequency band. Of course, 
using the relation c=vA, the length of the electromagnetic wave in 
vacuum may be determined if the radiation frequency is known. 

The radiation intensity is proportional to the frequency to the 
fourth power. Hence, very low-frequency radiation having wavelengths 
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of the order of hundreds of kilometres cannot be traced. Practically, 
the radio band. begins with wavelengths of the order of 1 or 2 km, 
which corresponds to frequencies of the order of 150 kc/s. Wavelengths 
of the order of 200 metres are in themedium-frequency band, while 
those of the order of tens of metres are in the short-wave band. Ultra- 
high frequencies (UHF) are beyond the usual radio frequencies; 
wavelengths of the order of several. metres and fractions of a metre, 
down to a centimetre (i.e., frequencies of the order of 10'°- 
-10** mc/s), are used in the television field and in radar. 

In 1924, Glagolyeva-Arkadyeva obtained even shorter electromag- 
netic waves. Electric sparks produced between iron fillings suspended 
in oil served as her source. In this manner, wavelengths down to 
0.1 mm were obtained. Thus, an overlap was achieved with thermal 
radiation wavelengths. 

Visible light occupies a very small band of wavelengths—from 
7.6 x 10-*cm to 4 x 107* cm. This band is followed by ultraviolet 
rays, which are invisible but very easily detected by means of labora- 
tory equipment: These wavelengths extend from 4 x 107° em to 
10° cm. 

Following the ultraviolet band is the X-ray band..The wavelengths 
in this band extend from 10-* cm to 107'°cm. The shorter the X-ray 
wavelength, the less the absorption by matter. Electromagnetic radia- 
tions of shortest wavelength, which are the most penetrating, are 
called y-rays (wavelengths of 10-* cm and less). 

The nature of any of the enumerated electromagnetic radiations 
may be completely determined as follows: First, the electromagnetic 
radiation is resolved into a spectrum by some method or other. In 
the case of light, ultraviolet rays and infrared radiation, this may be 
accomplished by refraction through a prism or by passing the radia- 
tion through a diffraction grating (see below). In the case of X-rays and 
y-rays, resolution into a spectrum is achieved by reflection from a crys- 
tal (see p. 381). The spectrum of radio waves is determined by making 
use of the phenomenon of resonance. 

The radiation spectrum obtained may be continuous or discrete, 
i.e., all frequencies may be present in a broad band of the radiation 
spectrum or the spectrum may consist of individual sharp lines cor- 
responding to very narrow bands of frequency. In the first case the 
spectrum is represented by a curve of intensity vs. frequency (or 
wavelength), while in the second case the spectrum is described by 
giving the frequency and intensity of the lines. 

’ Experiments show that electromagnetic radiation of given frequen- 
cy and intensity may not always have the same polarisation state. 
In addition to radiation in which the electric vector of the waves os- 
cillates along a specific line (linearly polarised waves), radiation also 
exists in, .which linearly polarised waves turned .relative to each 
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other about the beam axes are superimposed. Hence, to completely 
describe radiation, it is also necessary to indicate its polarisation. 

It should be noted that even for the slowest electromagnetic oscilla- 
tions, the electric and magnetic vectors of a wave cannot be meas- 
ured. The above descriptions of a field are based on theory. Neverthe- 
less, in view of the continuity and unity of all electromagnetic theory, 
there is no reason to doubt their veracity. 

The assertion that one or another kind of radiation consists of elec- 
tromagnetic waves is always based on indirect evidence. However, 
since these hypotheses have so many consequences that are in com- 
plete mutual agreement, the electromagnetic spectrum hypothesis 
long ago became accepted fact. 


124. Quantum Nature of Radiation 


We have already noted (p.154) that investigation of atomic phenome- 
ma has led to a law which states that the internal energy of a system 
cannot assume any arbitrary value, but is characterised instead by a 
system of energy levels. Energy radiation is related to the transition 
of a system from a higher level to a lower level. Energy absorption is 
related to the transition to a higher level. 

This applies, in the first place, to electromagnetic radiation. The 
quantum nature of submicroscopic phenomena was discovered at the 
beginning of this century as a result of investigation of a number of 
conflicting facts regarding electromagnetic radiation. 

Thus, the emission of electromagnetic radiation of frequency v 
by a dipole occurs in quanta (packets) rather than continuously. 
A quantum of energy is equal to Av, where / is Planck’s constant and 
is equal to 6.62 x 107° erg sec. 

The quantum nature of electromagnetic waves is manifested in 
absorption as well as radiation, for absorption too can only occur by 
means of energy quanta. If the value of an energy quantum is equal to 
the difference between certain energy levels of a system on which the 
wave impinges, the absorption process is quite pronounced. Such a 
process may be called resonance absorption. From the standpoint of 
classical physics, such absorption occurs when the frequency of the 
external field is equal to the oscillation frequency of the particles 
constituting the system. If the value of the electromagnetic wave 
quantum is less than the difference between energy levels, absorption 
cannot occur and the wave passes freely through the system. 

In quantum terms, the secondary radiation of a system is described 
as follows: A system absorbs a quantum of electromagnetic energy 
and is raised to a higher energy level. The system maintains this level 
for a certain period of time and then returns to its former energy level 
by giving up energy—again in the form of a quantum. 
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Since a quantum of energy is equal to Av, it is immediately evident 
that the higher the radiation frequency, the more pronounced the 
quantum phenomena. Nevertheless, the quantum nature of radiation 
has already been observed in practically all regions of the electromag- 
netic spectrum. It has even been possible to observe the quantum 
absorption of radio waves having wavelengths of several hundred 
metres (radiospectroscopy—p. 

The presence of one or another spectrum of electromagnetic radia- 
tion depends, in the first place, on the arrangement of energy levels 
in the system under consideration and on the transition probabilities 
of the system from an n-th level to an m-th level. If these probabili- 
ties were known beforehand and the energy level diagram were avail- 
able, it would be an easy task to determine the radiation spectrum 
of thesystem. 

We shall repeatedly be dealing with problems of radiation and ab- 
sorption of electromagnetic energy, but now let us consider some prob- 
lems of electromagnetic wave propagation in which the quantum na- 
ture of radiation is not manifested when the phenomenon is not accom- 
panied by the absorption and radiation of energy. 


CHAPTER XIX : 
PROPAGATION OF ELECTROMAGNETIC WAVES 


125. Dispersion and Absorption. 


In a homogeneous medium, the velocity of propagation and the 
direction of an electromagnetic wave do not change. The velocity 
of the wave is a maximum in-vacuum. In a medium, the wave veloc- 
ity is 

a c 

v= Ven ; 


and since in most practical cases »=1, we obtain 





pee 
ieee (gta 


The ratio of the velocity of wave propagation in vacuum to the 
velocity of propagation in a medium is called the index of refrac- 
tion. Thus, from electromagnetic theory we obtain the equationn = Ve, 
which is quite valid for very long wavelengths. As the wavelength 
changes, the index of refraction changes. This dispersion is alien to 
Maxwell’s electromagnetic theory, which regards a medium as a 
continuum and does not take into account the interaction of radia- 
tion and matter. Be that asit may, the equation n=Ve is not valid 
for rapid electromagnetic oscillations. 

When an electromagnetic wave is propagated through matter, the 
electric charges of the molecules are set into a vibratory state. Since 
an electron cloud moves freely as compared with heavy nuclei, 
electric oscillation consists in the displacement of the centre of 
gravity of the electrons relative to the stationary centre of gravity 
of the atomic nuclei’s positive charges. Designating the charge and 
mass of the oscillating electrons by e and m, respectively, the oscilla- 
tion equation may be written in the form 


mx = —kx—eE, cos ot. 


Or, dividing by m and using the formula for the natural frequency of 
oscillation, i.e., 9 = =, we obtain 


ss e 
X= 9X —— E, cos ot. 
m 
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We have equated the product of mass and acceleration to two forces— 
the restoring force —x and the external, periodically varying force 
eE, cos wt. This is the equation of forced harmonic oscillations. 
It is satisfied if 


X= x, cos wf. 


After substituting in the equation, we obtain 


e 
——E 
m 


x,= 





2 2° 
a,—® 


The dipole moment of a molecule is 
e 


0 2 


ex . 
2 
Q,—@ 





The polarisation vector, i.e., the dipole moment per unit volume, is 
N times larger, where NV is the number of molecules per unit volume: 


Ne? 
P=" EB, 


2 2 
O,—@ 








Recalling the formula relating the polarisation to the field inten- 
sity, i.e, ; 





e—] 
P= re E, 


it is seen that the permittivity of the medium has been expressed 
in terms of the parameters of the molecular dipole: 
, 4nNe* 


m 
a Ss alae ae 
QM) (0) 





The index of refraction of the medium should be equal to the 
square root of this expression. . 

As shown in Fig. 136, the nature of the dependence is confirmed by 
experiment. Here, the index of refraction vs. frequency curve, based 
-on the above formula, is compared with the curve based on measure- 
ments with specific substances.* What is the basic conclusion to be 
drawn from the experimental and theoretical results? In general, 
the index of refraction increases with increasing frequency in the 


* A more exact theory results in agreement with the experimental values 
in the region close to w, as well. 
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entire frequency range, except for the region in the immediate vicini- 
ty of resonance absorption. This: region is called the anomalous 
dispersion region. A substance may have several resonance frequen- 
cies, which correspond to the differences between its energy levels. 
Hence, there will be a corresponding number of anomalous disper- 
sion regions. 
Thus, the index of refraction of a wave, and hence the velocity 
of propagation, greatly depends on the value of the wave frequency 
relative to the natural fre- 
quencies of the molecular di- 
pans Theory poles. 
—— Fxperiment Naturally, the capacity of a 
substance to absorb an electro- 
magnetic wave depends on 
the same factors. Using the 
same reasoning as in the case 
of elastic waves (see p. 119), 
we arrive at a completely 
analogous formula: 


l=I,e~*4, 


which enables us to determine 
the value of the radiation in- 
Wo tensity / relative to the in- 

Fig. 136 cident intensity /, if the ab- 

sorption coefficient p and the 

layer thickness d through which the wave has passed are known. 
It should be recalled that the absorption coefficient is equal to 
the reciprocal of the layer thickness which decreases the radiation 


intensity to tof its original value. Due to the complex system of 


energy levels peculiar tomatter, the curve of the absorption coeffi- 
cient plotted against the frequency of the incident wave may appear 
odd and “erratic”. 

Until now we have been considering dielectric media containing 
only bound electric charges. Other relations exist when an electro- 
magnetic wave is propagated in a medium in which a considerable 
number of free electrons are present. Such media include metals and 
the ionosphere—a region of free charges akin to a gas. Using the theo- 
ry presented above, it must be assumed that in the formula for « 
the natural frequency «, of.a free charge is equal to zero (the frequen- 
cy is proportional to the rigidity of the bond). The dielectric constant 
is then given by the formula AmNe® 


m 
@? 









e=l1— 
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When @ becomes sufficiently large, the index of refraction n=Ve 
approaches unity. But when w*<4nNe*/m, the index of refraction is 
imaginary. This means that for the given values of frequency the 
waves cannot penetrate the metalor the ionosphere. On the other hand, 
for high frequencies the waves are “indifferent” to the presence 
of a medium containing electrons. These predictions are borne out in 
the case of radio waves. Thus, long and medium waves are reflected 
from the ionosphere and do not penetrate it, short waves penetrate 
the ionosphere, and uhf waves pass through the ionosphere unim- 
peded. 

The above presentation is greatly oversimplified. Hence, it should 
not be surprising that the conclusions are not valid for the optical 
region where the values of the index of refraction may be close to zero 
and also much greater than unity. 


126. Behaviour of an Electromagnetic Wave at 
the Boundary Between Two Media 


Just as in the case of an elastic wave, an electromagnetic wave is 
reflected and refracted at the boundary between two media. The basic 
laws of these phenomena may be subjected to theoretical analysis 
by utilising the boundary conditions on the electromagnetic field 
vectors. These conditions, discussed on pp. 256 and 288, follow in 
turn from Maxwell’s equations. Since the relationships between the 
fields on either side of the boundary are not arbitrary, the division 
of the wave into reflected and transmitted components is also not 
arbitrary. : 

The two fundamental relations may be expressed as follows: 
the tangential components of the electric and magnetic vectors on 
either side of a boundary 
must be equal. E 

What restrictions are 
imposed by these relations 
in the simple case of nor- 
mal incidence? This case is | 
illustrated in Fig. 137. As- 
suming that the electric 
vectors are in the plane of 
the page, then the magnetic Fig. 137 
vectors are perpendicular 
to this plane. We know that the electric and magnetic vectors 
and the direction of propagation may be viewed as a right-handed 
screw system, i.e., the rotation of vector E by the shortest path 
toward vector H appears counterclockwise to one facing the on- 
coming wave. To satisfy this requirement of electromagnetic theory, 
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the direction of either vector H or vector E must be reversed for 
the reflected wave. Thus, either the magnetic or the electric vector 
undergoes a 180° phase shift when the wave is reflected. 

When considering oblique incidence, it is necessary to determine 
which of the two actually occurs. It turns out that both are possible— 
one when the wave passes into a medium of larger ¢ and the other when 
it passes into a medium of smaller e. 

For normal incidence, the following calculations do not depend on 
the scheme chosen. Let us write the boundary conditions in the forin 


Eincia > Ey ict oe Eepeiih 
and 
A incia Pas Hye pect + pe pract: 

But the following relationship exists between the numerical values of 

the H and E vectors: 
H=VeE=nE. 
Hence, we obtain two equations, 
Eincia = Erin a Bae 
and 
NE incia = — NE re ptect + Nee re sracts 
whence the ratios Ey pec/Eincia Ad Eyetract/Eincia May be deter- 
mined. Since the wave intensity is proportional to the amplitude 


squared and the index of refraction (p. 322), we obtain for the coeffi- 
cients of reflection and transmission the following simple formulas, 


n . ‘a . 
where n= os is the relative index: 
iy 1 





2 
coefficient of reflection = ( Speltec) = 
. incid : dn 
sffied feat ot refract \ __ 
coefficient of transmission = ( pesca ) =e 

The similarity to the case of elastic waves (cf. p. 126) is very great. 

_ By means of such calculations, the general results presented below 

were obtained for the case of arbitrary beam inclination and polari- 

sation state of the wave. The agreement with experimental results is 
quite satisfactory. 

Since the sum of the reflection and transmission coefficients is 
equal to unity, the theoretical results are completely described by 
Fig. 138, where the intensity of the reflected wave is plotted as a 
function of the angle of incidence. 

Calculations and experiments show that the nature of the reflection 
curve depends to a significant extent on the polarisation state of the 
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incident wave relative to the plane of incidence. The electric field 
intensity vector E is “more important” than the H vector, if only in 
the sense that the photochemical action is due to E. Therefore, in 
describing the polarisation state of a wave, it is customary to specify 
it with respect to the electric vector. The orientation of the H vector 
is always easily found if the direction of propagation is known. Thus, 
it turns out that the reflection coefficient is different for two waves 
that are incident at one and the 
same angle mon the same boundary 
if in one case the electric vector is 
in the plane of incidence and in 
the other it is perpendicular to this 
plane. In the figure, curve / cor- 
responds to the case when the E 
vector is perpendicular to the plane 
of incidence, curve /// corresponds 
to the case when the £ vector is in 
the plane of incidence and curve // 
corresponds to the case when the 
wave is not polarised 


Coan 
In the first case, the change in P| tL peu AV I 


(toe domed eldchee neeen Eee 
Baw! 


little reflection, the coefficient 





being of the order of 5%; then, 20° 40° 60° TK 
with increasing angle the reflection 
coefficient increases, ever more rap- Fig. 138 


idly, until the glancing angle is 
reached. A beam whose electric vector is in the plane of incidence 
behaves in an entirely different manner. Its reflection intensity de- 
creases with increasing angle until it reaches zero at the angle @,. 
This angle is determined by the following interesting equation: 
n=tan gz. The figure is plotted for the value n=1.52 (transition 
from air to glass). Hence, the angle at which the reflection coefficient 
decreases to zero is equal to 56°40’. For a further increase in angle, 
the coefficient of reflection begins to increase and finally reaches 
unity. 

What is the reason for the absence of reflection in this particular, 
case? How does this case differ from others? Evidently, the answer 
must be sought in the boundary conditions, from which the entire 
theory of the phenomenon proceeds. We leave it to the reader to 
construct the field vectors for this angle and illustrate the require- 
ment. 

The following question may arise in the reader’s mind: If the 
boundary conditions enable us to understand all phenomena at the 
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boundary between two media, then what about total internal reflection 
where there is a field in one medium (E,,=40) but none in the other? 
The question is perfectly valid and the theory provides an answer. 
It turns out that under conditions of total internal reflection the field 
penetrates the second medium but is not propagated deeply into the 
medium. The condition E,,=E,, is not violated. 

A number of experiments have been devised for the demonstration 
of light wave penetration into a second medium under conditions of 
total internal reflection. Suffice it to recall the basically simple ex- 
periment proposed by Mandelshtam. A glass prism is'partially immersed 
in a solution of fluorescein—a substance exhibiting a characteris- 
tic fluorescence under the action of light. Then, a beam of light is 
directed onto the prism in such a manner that total reflection occurs 
on the inner side of the prism surface that is immersed in the solu- 
tion. The fluorescein thereupon glows intensely in an extremely thin 
layer next to the glass, proving that the electromagnetic wave has 
penetrated the solution. 


127. Natural and Polarised Light. Polarisation 
upon Reflection 


Place a glass plate at an angle @, to a light beam. The beam is 
reflected. Then, if the beam is turned about its axis (actually, the 
source of light is turned about the beam axis), it might be expected 
that at some position the beam will not be reflected. But if natural 
light is used for the experiment, this does not occur, i.e., for every 
azimuthal position of the incident beam, the reflected beam has the 
same intensity. It would be wrong to consider this a refutation of the 
theory presented in the preceding article. This experiment merely 
shows that the polarised state of a beam of natural light is more 
complex than given by the scheme of two vectors, E and H, having 
fixed directions of oscillation. 

Now, let the above beam, reflected at an angle g,, impinge on a 
second plate placed at a similar angle p, to the beam reflected from 
the-first plate. Then, turn the beam about its axis. Since, of course, 
only the relative position of the beam and the reflector is of impor- 
tance, it is easier to turn the second glass plate. Investigation of this 
double reflection shows that the reflection varies with the position, 
and the position for which no reflection occurs is easily found. It is 
evident that this position corresponds to a mutual orientation of 
beam and reflector for which the electric vector of the beam is in 
the plane of incidence. The following conclusion may be drawn: 
reflection from the first reflector results in the natural beam acquir- 
ing a polarised state in which a singie oscillation direction of the 
electric vector is separated out. 
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In contradistinction to natural beams, beams in which the vectors 
have a specific oscillation direction are referred to as polarised beams. 
How should the polarised state of a natural beam be envisaged? 
It is necessary to assume that in a natural electromagnetic wave all 
possible oscillation directions of the electric vector are uniformly 
present. The word “possible” should be underlined since electromagnet- 
ic theory shows that the electric vector is of a transverse nature. 
In essence, therefore, a natural unpolarised wave is a superposition of 
numerous linearly polarised waves having a uniform distribution as 
regards the vector oscillation directions. All transverse directions 
are electric vector oscillation directions of a beam of natural light. 

Reflection from two successive reflectors, fixed at an angle g, to 
the beams, is one method of polarising beams of light. 

An electric vector of natural light may always be resolved into two 
mutually perpendicular components. When reflection is being inves- 
tigated, it is most convenient to resolve each vector into two compo- 
nents—one in the plane of incidence and the other perpendicular to 
this plane. Thus, the behaviour of a natural beam may be equated to 
the behaviour of two such component waves, if we take into account 
that the phase difference between them varies randomly. Therefore, 
in describing the polarisation of light, we say that oneof the compo- 
nents has not been transmitted, or has been transmitted to such 
and such an extent. If upon reflection or refraction one of the compo- 
nents of light is transmitted to a larger extent than the other— 
which the reflection curves show to be the case—this signifies that the 
light has been partially polarised. 

We can utilise this phenomenon to obtain total polarisation of a 
beam. Instead of using two reflectors fixed at an angle pz to the beams, 
it is much easier to transmit a beam through a pack of glass plates. 
Each refraction will increase the share of one of the components in 
the beam by a certain percentage. In this manner almost total pola- 
risation may be achieved. 

The natural state of a light beam is unpolarised. However, this 
does not mean that every beam that has not been subjected to reflec- 
tion or refraction is unpolarised. This applies particularly to radio 
waves. The short electromagnetic waves used in the transmission of 
television are highly polarised. It is precisely this circumstance that 
enables us to determine the direction of the transmitter by the orien- 
tation of the receiving antenna. The electromagnetic waves which act 
as the carrier of a television programme are highly polarised. Hence, 
the antenna must be oriented in such a manner that the oscillation 
direction of the electric vector coincides with the antenna direction. 
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128. Propagation of Light Waves in a Medium Having 
a Refractive Index Gradient 


As arule, a difference in density is associated with a difference in 
refractive index. The natural question arises: What is the nature of 
the wave propagation in a medium in which the value of the refrac- 
tive index varies from point to point, i.e., the refractive index gradi- 
ent differs from zero? 

A difference in refractive index signifies a difference in the velocity 
with which the wave front advances. It follows, therefore, that the 
wave front will be continuously deformed as it advances in such a me- 
dium. If we construct the normals to the wave front, we obtain a 
curved line. Thus, itcan be stated that in anonhomogeneous medium 
light is propagated curvilinearly rather than rectilinearly. 

The analogous problem for -sound waves was discussed earlier 

(p. 146). The same laws are applicable here and the beam path is also 
determined by Fermat’s principle. A beam of light propagated in a 
finite medium having a refractive index gradient follows a path be- 
tween two points that requires aminimum amount of time to traverse. 
Therefore, the beam of light bends so as to shorten its path in regions 
where the refractive index is large and lengthen its path in regions 
where the refractive index is small. 
- The best example of the propagation of light in a medium of gradi- 
ent n is the passage of a beam of light through the Earth’s atmosphere. 
Since the density and the index of refraction of air decrease with 
increasing elevation, it follows that refraction occurs in the atmos- 
phere. A beam travelling from a star to the Earth and entering the 
atmosphere at an angle rather than along a radius will bend; hence, 
the apparent position of the star is displaced relative to its true posi- 
tion. For a star at the zenith, the displacement angle is as much as 
*/, of a degree. 

Mirages are caused by the presence of a refractive index gradient 
in the atmosphere. They occur in the African deserts due to the fact 
that heat currents are easily formed above the hot sand, resulting in 
temperature gradients and, hence, density and refractive index grad- 
ients. As a result, the light beams travel along curved lines-and a 
landscape seems to appear where the observer, accustomed to the rec- 
tilinear propagation of light, conceives it. 

Of course, in the case of light propagation in a nonhomogeneous 
medium, the waves are neither spherical nor planar. It should be re- 
called that a variable propagation velocity signifies that the wave- 
length also varies from point to point. What is the-equation. of wave 
motion in a medium where the refractive index changes from point 
to point? Since the parameters of the wave change from point to 
point, the equation being sought must be a differential equation, for 
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only a differential equation can express the dependence between the 
physical quantities for a given point in space. 

This equation may be found by means of Maxwell’s equations. 
Since the derivation is rather complex, it will not be presented here. 
The calculations yield the following relation, which is valid for the 
E vector (or its projection) as well as the H vector (or its projection): 

ey 1 ay 
“Os! ®t? * 


The function V is called the wave function. It represents the E 
vector, the H vector, or their components since the equations are the 
same for each case. Here, s is the coordinate in the direction of wave 
propagation, ¢ is the time and v is the propagation velocity. 

This equation is called the wave equation and it is valid for all 

points in space located outside the sources of the field, i.e., outside 
charged regions and regions in which electric currents flow. 
_ First it will be shown that the above differential equation is satis- 
fied by the simplest wave process, i.e., a plane wave. As we know 
(p. 113), the expression for a plane wave of frequency o, propagating 
in the direction s, has the form 


B— Acoso (4 —<) : 


Let us determine the second derivative of the wave function W with 
respect to time and with respect to the coordinate. We obtain 


2 2.1) 2 


ot? os? 


It is seen that the following relationship must exist between the sec- 


ond derivatives: 
OY: 2 Net), 
“oot ot” 
hence, the equation of a plane wave is embodied in the proposed 
differential equation. However, the above differential equation em- 
braces much more. Any function of argument (4 —=) is a solution of 


the equation since for any function ¥ (4 —=) the derivative expres- 
sions in Y are the same. 
The dependence of a function on the argument (¢ —F)is regarded 


as the sole indication of a wave process. The significance of this argu- 
ment consists in the following: If the state at a point s=0 is character- 
ised at the instant of time ¢=0 by a certain value of the wave func- 
tion, then the same state occurs at point s, at the instant of time 


12-3580 
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t,=-—+, at point s, at the instant of time ¢,= = ete. Heres is 
z U U 


the coordinate reading along any rectilinear or curvilinear path. . 
The differential equation 
: ry 1 &y 


ae CL? 
is the'general equation of a wave process and is valid for any medium, 
including nonhomogeneous medium where v varies from point to 
point. 

If it is necessary to express the wave function in terms of the three 
space coordinates x, y, z, the generalised wave equation has the fol- 
lowing form: 

oy ey , ey 1 oy 
Ox? oy | 
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The sum of the second partial derivatives of a function is concisely 
designated by the symbol AW (read: Laplacian of Y). Thus, 


1 ey 
A Fat 

The differential equation of a wave is valid for any process in which 
the value of the wavelength and the amplitude of the wave vary from 
point to point. 

Let us designate the amplitude of the wave function WY by ». 
For most problems, it is primarily p that interests us. If a vibratory 
process of frequency o occurs in a region, then, in the most general 


case: 
oy 


OE = of, 


Therefore, a wave function will always satisfy the equation 


@? 
wy 
AY = ——, B, 


The part of the expression for V that is a function of time always 
cancels in such an equation. Hence, the last equation is the equation 


for the wave amplitude p. By means of therelationA =v zal it may 
also be written in the form , 


Ap +5r v=o. 


Sometimes this equation too is called the wave equation. 
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129. Propagation of Radio Waves 


Radio waves are propagated in accordance with the laws of reflec- 
tion and refraction. In order to obtain the concrete results required 
for our discussion, it is merely necessary to generalise the theory to 
the case of a medium having a continuously varying coefficient of 
refraction. But this has already been done for elastic waves (see p. 145) 
and is completely applicable for electromagnetic waves, i.e., for 
light as well as radio waves. A wave travelling in a medium of vari- 
able n, i.e., a wave travelling with variable velocity, is propagated 
in such a manner that the least amount of time is taken to traverse 
the distance between two points. The path of the wave will be curvi- 
linear and in passing from one layer of the medium to another layer 
where n is greater the wave will be deflected toward the normal to 
the boundary. 

In order to determine the nature of the radio-wave propagation, 
the electrical properties of the Earth and the atmosphere must be 
known. The electromagnetic field of the wave is greatly affected by 
the magnitudes of the electrical conductivity and the dielectric 
constant of these two media. 

How is the difference in behaviour of electromagnetic waves of 
diflerent length explained? Of course, a significant role is played by 
dispersion. But an approximate indication of the behaviour of an 
electromagnetic wave may be obtained from an examination of the 
relationship between the displacement current and the conduction 
current. It is evident that a medium exhibits dielectric properties 
when the displacement current is much greater than the conduction 
current. On the other hand, if the displacement current is neg- 
ligible, the medium may be considered to be a conductor. 

The properties of the Earth’s surface and the properties of the 
atmosphere must be examined from this standpoint. 

Let us take a typical example. Experience in the field of radio engi- 
neering has shown that a flat terrain covered with trees may be char- 
acterised by a dielectric constant e of the order of 12 and a specific 
electric conductivity y of 7 x 10’ (in the Gaussian system). To study 
the propagation of waves over the surface of a sea, it is important 
to know the values of e and y for sea water. These values are 80 and 
10'°, respectively. The ratio of the conduction current density to the 
displacement current density (see p. 315 for the required formulas) is* 
given by the formula : 


teond 2 YR 9 7519-10 YR 
idispt = & ; @ 

in the Gaussian system of units. For long waves, say 2,000 metres, 

this ratio is equal to 77 for a wooded area and to 1,600 for asea surface. 


12* 
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The medium in each case, but especially in the latter, may be consid- 
ered {o be a good conductor. For short waves, say 20 metres, the 
first value decreases to 0.77 and the second to 16. This means that 
for short waves sea water continues to be basically a conducting medi- 
um, while a wooded area acts to a considerable extent as a dielectric. 

Waves propagating over a conducting surface “cling” to this sur- 
face. The electric flux lines approach the Earth at right angles and 
travel along the terrestrial surface. That is why an electromagnetic 
wave can easily travel around the globe. (It takes 0.13 sec to do this 

-and since this time can be measured quite accurately we can deter- 
mine the propagation velocity of radio waves.) This applies to long 
waves. Short waves cling only to sea surfaces. In other regions, they 
behave like perfectly free waves. When such a wave travels along 
the surface of the globe, it penetrates the Earth and is absorbed. 
Moreover, the higher the oscillation frequencies, the greater the ab- 
sorption. 

A number of remarkable features in the behaviour of radio waves 
is explained by the presence in the upper layers of the atmosphere of 
a layer containing a large number of free ions and electrons. This 
layer is known at the ionosphere. Thus, the region in which an elec- 
tromagnetic wave travels may be roughly pictured as a dielectric 
bounded by two conducting layers. 

Ionisation of the atmosphere is not uniform, i.e., the number of 
free charges per unit volume varies from one layer to the next. As 





Fig. 139 


was seen in Sec. 125, the coefficient of refraction decreases as the num- 
ber of charges increases. Since the coefficient of refraction of a con- 
ducting medium is less than unity, a wave entering the ionosphere 
at an angle from a dielectric medium is deflected from the normal. 
The ionisation increases; hence, the deflection increases with each 
succeeding layer. 

Furthermore, as Fig. 139 shows, a wave may either pass through 
the ionosphere and recede from the Earth or, after being bent more 
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and more, return to the Earth. Roughly speaking—disregarding 
the nonuniformity of the ionosphere—a wave returns to the Earth 
if it strikes the ionosphere at an angle greater than the total internal 


teflection angle: 
{Aa Nl 2, 
sin), >n = V 1— ane ‘ 





For smaller angles, the wave is propagated into outer space. By repeat- 
edly being reflected from the ionosphere and the terrestrial surface, 
short waves can round the globe, experiencing considerably less 
energy losses than in the case of long waves. 

Since uhf waves can pass through a layer of free charges, they are 
not reflected from the ionosphere. Therefore, radio reception on 
uhf is possible only along the line of sight. 

The above picture of the atmosphere is greatly oversimplified. In- 
vestigations have shown that the density distribution of free electric 
charges in the atmosphere is characterised by several maxima since 
the ionosphere is composed of several layers. The stability of these 
layers differs, depending on the time of year. It is interesting that 
the existence of the layers is related to solar activity. Thus, vari- 
ations in the state of the ionosphere corresponding to the 11-year 
sun-spot cycle may be observed. Ionisation of the upper layers of the 
atmosphere is undoubtedly related to the arrival of cosmic radia- 
tion on the Earth. 

From a study of the electrical properties of the ionosphere and 
the surface of the Earth, radio engineers have drawn a number of 
conclusions regarding the most favourable conditions for radio trans- 
mission and reception on waves of various lengths. However, we 
shall not go into this subject. 


130. Radar 


A radar station consists of transmitting and receiving equipment. 
Every ten-thousandth of a second (A in Fig. 140), the transmitter 
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Fig. 140 


sends a pulse of duration a (of the order of several microseconds) into 
space. If an object capable of reflecting the wave is intercepted in 
the solid angle “illuminated” by the radio waves, a part of the wave is 
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reflected back to the radar station. The reflected signal is received 
oak sec after the pulse is transmitted into space. 


This time may be measured by means of an oscilloscope. The sweep 
of the electron beam is synchronised with the transmitted pulses, 
and the demodulated signal from the receiver is fed to the second pair 
of oscilloscope plates. As a result, a “pip” displaced relative to the 
initial point of the sweep by a distance proportional to the time t 
appears on the oscilloscope screen. If the object intercepted by the 
radar pulse is stationary, the “pip” on the oscilloscope screen will 
also be stationary. Thus, synchronisation is achieved when the sweep 
time equals one ten-thousandth of-a second, the interval of time be- 
tween successive transmitted pulses. If the object “seen” by the radar 
is moving, then the pip on the oscilloscope screen also moves. 

Modern radar systems are much more complex than indicated by 
this simplified picture. The motion of the electron beam of an oscillo- 
scope from the centre to the edge of the screen is more complex than 
motion along a radius. While moving outward along the radius, the 
electron beam slowly rotates about the centre of the screen like the 
hand of a clock. This rotation is synchronised with the rotation of the 
radar antenna in such a manner that the illuminated line points in 
the same direction as the transmitted radio beam. In addition, the 
following important change in the operation of the oscilloscope is 
introduced: If the radio beam does not encounter an obstacle, and 
hence the receiver does not pick up a reflected signal, the oscilloscope 
screen remains dark. On the other hand, if a pulse is received, a 
spot on the screen is illuminated. 

Thus, when a beam scans the horizon and encounters a body, this 
body is indicated on the oscilloscope screen by an illuminated spot. 
The distance of this spot from the centre of the screen is proportional 
to the distance of the radar from the object, and the azimuthal angle 
indicates the direction of the object. 

Oscilloscope screens possess an afterglow; hence, an illuminated 
spot does not disappear while the radar is scanning the area, i.e., 
before returning to the same position. If the illuminated spot is due 
to the beam reflected from a fixed object, the image on the oscillo- 
scope screen is also fixed. If the object moves, a moving image will 
appear on the screen. 

Due to the difference in the reflection coefficients of various ob- 
jects, a characteristic picture of the region is depicted on the screen 
of aradar system with circular scanning. Rivers and lakes appear dark 
(little reflection), the Earth appears lighter and woods still lighter. 
Of course, metal objects are “seen” very clearly. 

The nature of the visibility varies in accordance with the wave- 
lengths used. Thus, for radio waves in the centimetre range, clouds 
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are seen very clearly. Since longer waves are insensitive to clouds 
and rain, radar systems operating on such wavelengths can be used 
in all kinds of weather if they are not intended for the specific pur- 
pose of detecting clouds. 

Radar principles find broad application in science and engineering. 
Thanks to radar, pilots experience no difficulty in conducting night 
flights and in landing on airports which are not illuminated. Radar 
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Fig. 141 


is of great importance in meteorology. In addition to enabling us to 
detect rain and storm clouds at great distances or at night, which 
is essential for weather-forecasting, radar can be used to track meteor- 
ological balloons. Radar equipment installed on ships is a great aid 
to navigation safety, reducing to nil the possibility of accidental 
collisions of a vessel with other vessels or obstacles. In the field 
of astronomy, radar methods are used to determine the distance to 
meteors, and the direction and velocity of their flight. The waves are 
reflected, in the main, from the meteorite “trails”, which consist of 
ionised gases. The Moon, the Sun and the planets are all within the 
reach of radar. Radar astronomy is of great practical importance. It 
has enabled us to develop navigational instruments to determine the 
. location of a ship, from observations made on heavenly bodies, in 
any kind of weather and at any time of day or night. 

A vast amount of the literature is devoted to radar problems. 
Since such problems rightly belong to the field of radio engineering 
and not physics, we have restricted ourselves to an elaboration *of 
the principles of this remarkable development. 

Fig. 141 shows a block diagram of a radar system. 


CHAPTER XX 
INTERFERENCE PHENOMENA 


131. Addition of Waves from Two Sources 


First, let us consider two ideal sources radiating spherical waves. 
Assume both sources are oscillating synchronously. Im this case, for. 
waves of any type, a characteristic field is created in which bright 
and dark “fringes” appear where the waves reinforce and cancel each 
other, respectively. This phenomenon is most easily demonstrated 
by means of water waves. 

The mathematical calculations are straightforward. Take any 
point a distance r, from one wave source and r, from another. Then, 
maximum reinforcement of the waves occurs when the path difference 
r,—rz, equals a whole number of wavelengths, nd. On the other hand, 
the waves annul each other when the path difference equals an odd 


number of half-wavelengths (2n+1)4%. 


We know from analytical geometry that a 
curved surface all of whose points satisfy the 
following condition is a hyperboloid: the 
difference between the distances to two foci is 
a constant. In Fig. 59 (p. 122) a plane is drawn 
through the wave sources. Shown in this 
plane are hyperbolas—loci for which the 
difference between the distances to the wave 
sources is a constant. 

Now, let us consider the pattern obtained 
in a wave field on a cylindrical screen whose 
axis passes through the radiators as shown in 
Fig. 142. (We shall assume throughout that 
the radiators are point sources.) This inter- 
ference pattern will consist of alternating 
bright and dark horizontal lines since the 
conditions are exactly the same for all points 
on the cylinder located at the same height, 
i.e., relative to the radiation sources all such points are in the 
same state. A bright band will appear along a line around the mid- 
dle of the cylinder; since the distance from both sources is the 
same, the waves reinforce each other. For points at a height z 
above the mid-line, the difference in the paths traversed by 


the rays, r,—re, will be represented by ces But n—r, =2lz, 


Mls 
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where / is the distance between the sources. Thus, the condition 
for the n-th bright fringe has the form 





2lz 
nth, sae 
If the radiators are far from the screen, then for fringes-close to 
the centre, 


r,+1r,=2R, 


where R is the radius of the cylinder. The bright bands pass through 
the points z satisfying the condition 


lz 
pan. 


* ‘ : : d 
The distance between adjacent fringes is Az=R , 


Example. If two coherent sources (see Sec, 132) separated by a distance 
1=1 mm radiate light of wavelength 1=6,000A, then the distance between 
interference fringes on the surface of a cylinder of radius R=1 metre is 


Az =k = 0.6mm. 


If the light source radiates waves of various wavelengths, the 
interference pattern will be coloured since the maximum conditions 
differ for different values of 4. 

In addition to determining the positions of maximum and 
minimum interference, it is also of interest to determine 
the form of the intensity curve across the fringes. 


Since < is the path difference be- 


tween the waves, then 
6a 2m I 
i. R 
is the phase difference, and the total 
amplitude at any point is given by 
Acos wt + A cos {wi + 8). Z 


For equal amplitudes, we obtain the 


expression derived on p. 102: 
2A cos cos ( ot -L 5) : | | | 


The measured intensity (wave am- 
plitude squared) is equal to the average 
value of this expression taken over : 
the oscillation period. Fig. 143 
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[cos" (ot + >) 7 =+ 


(see the following article regarding calculation of the average), then 


Since 


= 2 2 niz 
I = 2A?’ cos aR * 





The intensity curve may be plotted as a function of the vertical coor- 
dinate z (Fig. 148). 


132. Coherence 


The superposition of two waves, described in the preceding section, 
may be physically achieved by various means and for various wave- 
lengths. For example, two antennas radiating radio waves may be 
placed close together, two electric bulbs with point filaments may be 
placed close together, or an incident ray and the same ray reflected 
from a mirror may be brought together. Experiments show that by no 
means does interference occur in all cases. This phenomenon can best 
be investigated by examining the superposition of the fields of two 
antennas. It is easily shown that an interference pattern is obtained 
only when a phase difference that remains constant during the time 
of observation exists between the superimposed waves. In this case 
the oscillations are said to be coherent. 

If the phase difference is fixed, the amplitude of the electromagnet- 
ic oscillations at a given point in space is constant. Thus, amaximum 
point remains a maximum and a point where the waves completely 
cancel each other always has zero intensity. 

When the phase difference varies randomly, the pattern is complete- 
ly different. During a certain interval of time the amplitude of 
the oscillations at a given point is a maximum, in the succeeding 
interval it assumes intermediate values, and then for an interval of 
time the waves cancel each other. If the duration of these intervals 
were commensurable with the practical capabilities of instruments, 
a fluctuating interference pattern could be detected. If the variations 
in phase difference are so rapid as to preclude detection by these 
instruments, the interference pattern is not revealed and the average 
value of the intensity is shown on the instruments. In such cases, 
we say that the oscillations are noncoherent. 

What is the expression for the average intensity in a region where 
fields are superimposed? This is easily determined. 

The amplitude of the total wave at a given point and at a given 

instant may be expressed in the form 


A, cos wt -+ A, cos (wt + 8). 
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The instantaneous intensity is proportional to this expression 
squared, i.e., it is equal to 


Aj cos? wt +- A, cos” (wt + 5)-++ 2A,A, cos wt x cos (wt +4). - 


We are interested in the time-averaged intensity of the radiation, 

i.e, 

1 = Aj (cos’ at), + Az [cos? (wt + 8)]g,-+2A,A, [cos wt X cos (wt + 6)]o5- 
The average values of trigonometric quantities are encountered quite often 

in physics. It is therefore useful to recall that the average values of sin x and 

cos x are equal to zero, and the average values of sin?x and cos*x are equal to '/,, 


if the argument x of the trigonometric function assumes all values with equal 
probability. The average value of a function f(x) is by definition 


If (X)iop = Lett eine with (en) 


This formula can be used to calculate the average if the variable x assumes 
discrete values. But if the variable x is continuous and assumes all values in 
the interval from a to b, the formula for calculating the average value is ob- 
tained in the following manner. Divide the interval (6 — a) into n segments 
of length Ax. Multiplying the numerator and denominator by Ax, we obtain 


__ f(xy) Ax+f (x2) Ax+... 


If (Dao nae 


Going over to the limit, this takes the form 
b 
1 
lf Mlao= phe J F (x) ae. 
a 


By means of this formula, we can calculate the average value of any func- 
tion of a continuously varying random quantity. In calculating the average 
value of a periodic function, a single period should be used in determining the 
limits of integration, for the average value of one period is clearly equal to the 
average value of any number of periods. Thus, for example, 


[cos? Jao = iy cos? xdx=y . 
Let us write the formula for the intensity in the form 


I = A} (cos? wt), + Az [cos? (wt + 8)],9 + A,A, [cos (2af + 8)],,-+ 
+ A,A, (cos 8),,. 


Using our knowledge regarding the average values of cos x and cos’x, 
we obtain: for a phase difference between two waves varying randomly, 
i.e., for noncoherent oscillations, 


=x (Ai +A); 
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on the other hand, if the phase difference is fixed, i.e., if the oscilla- 
tions are coherent, 


I= + (Ai + Ad) + A,A, cos 6 


or, for equal amplitudes, 
I = 2A? cos’ 4 ‘ 


This last formula is the same as the interference formula derived in 
the preceding article. 

Radio waves radiated by neighbouring antennas may be made 
coherent or noncoherent by technical means. 

As regards oscillations of light waves, at first glance it would seem 
that it is impossible to achieve coherence since the radiations of 
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Fig. 144 


individual atoms have no definite relationship to one another. The 
phases of the waves emitted by individual atoms are haphazardly 
distributed. It is quite natural that two light sources, no matter how 
closely they approximate point sources, do not yield an interference 
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pattern. Nevertheless, coherent light oscillations do exist. They occur 
for rays “taken” from one and the same light wave. 

Methods of artificially securing coherent sources are shown in 
Fig. 144. In one case two mirrors that are slightly inclined relative 
to each other are used and in the other a double prism (biprism) is 
used to produce wave sources from two virtual centres. Interference 
fringes may be observed on a screen placed anywhere in the inter- 
ference field. The theoretical discussion presented above is entirely 
applicable to these cases; the 
nature of the fringes is deter- 
mined by the distance between the 
virtual images of the light source 
and the distances from these 
images to the point of observa- 
tion. 

It is quite clear why the two 
parts of the “divided” ray are 
coherent. Between any pair of 
atoms of a real source, there is 
no coherent relationship. How- 
ever, by dividing sucha ray into 
two parts, we enable the radia- 
tion of each atom to interfere 
with itself. 

Thus, consideration of various 
cases of light interference is re- 
duced to the investigation of various cases of division of a flight 
ray by reflection and refraction, and the successive superposition 
of the components of the divided wave in the region of the 
interference field. 

The size of the light source significantly affects the coherence of 
the “divided” ray. Let us assume that the light source is of length 6 
and that rays in the solid angle 2u take part in the creation of the 
interference field (Fig. 145). Rays such as J and /’ emanate from 
a single atom and are therefore coherent. The same is true of 2 and 
2’. Consider the interference due to the superposition of the fields 
of rays /, 2and /’, 2’. For interference to occur, ‘the fields of the co- 
herent beams /, /’ and 2, 2’ must reinforce each other. The path dif- 
ference A =6 sin u, which exists between 7’ and 2’ as well as between 
1 and 2, tends to prevent this. Moreover, interference becomes pos- 


sible only when 6 sinuw< us 
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133. Interference in a Plate 


Let us investigate reflection and refraction of light incident on a 
flat plate of thickness d (Fig. 146). 

Assume a plane wave impinges on the plate at an angle 7. The 
beam of lighi is reflected and refracted. Moreover, the refracted beam 
strikes the lower surface of the plate and is also reflected and re- 
fracted. As aresult, there arise numerous rays parallel to the primary 
reflected beam, and also numerous parallel rays transmitted in the 
second medium. All these 
rays are coherent and a 
phase difference exists be- 
tween them. Hence, the con- 
ditions are present for inter- 
ference in the reflected as 
well as in the transmitted 
rays. 

As is well known (see 
p.332), the reflection coeffi- 
cient is not very large—at 
least, for normal incidence. 

Fig. 146 In this case, the intensity 
of each “succeeding” ray is 
much less than the intensity of the preceding one. For example, 
for a reflection coefficient of 5%, the first reflected ray has an 
intensity of 0.05 J,. The second reflected ray undergoes two refrac- 
tions and one reflection. Its intensity is 0.95x0.95x0.05 J, = 
=0.045 /,. Thus, the intensities of the first two rays are practically the 
same. But the third ray is much weaker since it undergoes three 
reflections and two refractions. Its intensity is equal to 0.95 x0.95 x 
x0.05 X0.05 x0.05 /,, i.e., one four-hundredth of the intensity of 
the preceding ray. 

Under conditions of small reflection coefficient, the phenomenon 
reduces to the observation of the interference of the first two 
rays. 

As regards the transmitted rays, under conditions of small reflec- 
tion coefficient the interference is not noticeable since the intensity 
of the second ray is one four-hundredth of the first, the intensity of 
the third is one four-hundredth of the second, etc. However, it is 
not very difficult to set up the experiment in such a manner that in 
the reflected as well as in the transmitted beam numerous interfer- 
ence rays occur. 

If a monochromatic wave impinges on a flat plate, the interference 
pattern is determined by the phase difference between the first and 
second reflected rays, 
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From the wave formula 
Acos@ (4 ~=) : 
v 


it is evident that the phase of the wave, traversing the path x with 
velocity v, changes by o- » or = x, where A is the wavelength in the 
medium. Designating the wavelength in vacuum by A, and recalling 
a 
be written in the form= nx. The product nx is often called the opti- 
cal path of the wave. If awave passes through several media, its phase 
changes by “tS, where S =(n,x,+n2x.+...) is the optical path. 

The phase difference6 between the interfering waves, which deter- 
mines the intensity of the resultant wave, is given by 


Qn 
=a 


that the refractive index n is equal to —°, the change in phase may 


i.e., it is determined by the difference between S’ and S”, the optical 
paths of these waves: A=S’—S”. 

Referring to Fig. 146, let us calculate A for the case which interests 
us. It is most convenient to express A in terms of the refraction angle 
r, the plate thickness d and the index of refraction n. As seen from 
the figure, 

A= 2dncosr. 


However, in addition, it is necessary to take into account the phase 
jump occurring upon reflection (cf. pp. 332-33). In this respect, the first 
and second rays differ, for the first is reflected from the external 
surface of the plate, while the second is reflected from the internal 
surface. Therefore, the electric vector of one of the rays undergoes 
a 180° phase jump and the other does not. Thus, the resultant phase 
difference is 


Qn 
6 =— 2dncosr+n. 
Ao 
Maximum interference occurs when 6 =m2n, where m is a whole 
number; minimum interference occurs when 6 =ma. Therefore, 
: oe A 
maximum condition: 2dncosr= md, -+~>° ; 
minimum condition: 2dncosr=ma,. 


Thus, depending on the value of 4, n, d and +r, interference may 
cause the intensity of a wave reflected from a plate to be zero or 
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a maximum. In an ideal experiment with a monochromatic beam, 
by varying the angleof incidence, for example, areflected ray should 
alternately vanish and reappear. In an analogous experiment with 
a beam of white light, the plates should pass through all the colours 
of the rainbow in succession. 


134. Fringes Representing Equal Thickness and Fringes Representing 
Equal Inclination 


Several factors enter into the extremum condition 2dn cos r= mh. 
Hence, if they are varied simultaneously, a confused picture may 
result. The effect is clearest when all the parameters, except one, may 
be considered fixed. 

If a plate has a variable thickness d, a constant refractive index and 
a practically constant angle of incidence (and hence angle of refrac- 
tion) for the portion of the plate under consideration, the interfer- 
ence will be observed in the form of fringes representing equal 
thickness. All parts of the plate having the same thickness d will be 
subject to the same conditions. Therefore, on an uneven plate, there 
appears a system of bright and dark fringes (or rainbowed in case of 
white light). These lines connect points where the thickness of the 
plate is the same.This explains the coloured fringes often seen on 
oily films spread on the surface of water. If a plate is wedge-shaped 
the fringes representing equal thickness consist of straight lines. 
Such fringes may be easily observed on soap films. In the case of a 
vertical film, the soap trickles down and the film becomes thinner in 
the upper region; horizontal fringes appear on the film. 

When light impinges normally on a plate, cos r=! and fringes 
appear on the plate where the thickness d satisfies the relation 2dn = 
=mn,. 

The difference in the thicknesses of the plate represented by adja- 


cent fringes equals fet i.e., a half-wavelength. Thus, bright 
fringes representing equal thickness indicate nonuniformities in 
plate thickness of the order of a tenth of a micron. 

If the thickness from one point to another varies very slowly, the 
fringes may turn out to be very far apart. Thus, for example, in a 
dripping soap film a wedge may form having a 0.5-minute angle; 
in this case, as can be easily calculated with the aid of Fig. 147, the 
fringes will be 2 mm apart. 

If a wedge decreases to zero thickness, the end of the wedge appears 


dark in the reflected light since thicknesses less than 7 do not reflect 


‘light. The first bright fringe occurs for the thickness pat (the path 
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difference is twice as great since the return path must also be included 
in the calculation). The next bright fringe occurs for d=A, etc. Thus, 
the thickness may be determined by simply counting the fringes. 
The question naturally arises: Why are fringes representing equal 
thickness easily observed on thin films but not, for example, on a 
windowpane? The answer is that 
it is not possible to create the 
ideal conditions whereby the 
single variable quantity is the 2mm 
plate thickness d. 
Let us consider the effect of 
a spread in the angle of incidence 
(refraction). If the angles vary a 
from r, to rz in such a manner 2 
that on the interference maxi- 
mum for r, there is superimposed Fig. 147 
the extinction for r,, the inter- 
ference fringes will be smeared. What is the value of the angular 
interval Ar=r,—r, that smears the pattern of fringes representing 
equal thickness? The value can be determined from the conditions: 


2dncosr,=maA and 2dncosr, = (m — x) A, 


whence 


2dn (cos r,-— COS r,) = 4 . 


For simplicity, let us restrict ourselves to the case of normal in- 
cidence; let r, equal zero and r, a small value Ar. Then, 


Ar)? 1 
a Qin=md and 2Qdn (1 AE =(m —3) he 
2dn — = ah 
and since 2dn=ma, 
(Ar? = ; 


m 

If the plate is thin, the values of m are measured in units or tens of 
units. In this case, an angular spread of about a tenth of a radian, 
i.e., 5°-10°, does not smear the pattern. However, for a plate 
1 mm thick, m is already of the order of 5,000. Here, an angular 
spread of the order of only a hundredth of a radian suffices to prevent 
observation of fringes representing equal thickness. 

However, even if the geometry is ideal, relatively thick plates do 
not yield an interference pattern. This is due to the fact that real 
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light is not ideally monochromatic. An ideal waveis an infinite sinus- 
oid. But an atom radiates for a very short period of time. Experi- 
ments show that a real wave is a sinusoidal wave train whose length 
is of the order of ten to a hundred millimetres. In order for interfer- 
ence to occur, the path difference must not exceed the length of the 
sinusoidal wave train; otherwise, coherence does not exist between 
the superimposed waves. If the length of the sinusoidal wave train 





is equal to 100 mm, the path difference corresponds to a value of m 
of the order of 150,000. 

Now, let us consider fringes of another kind, namely, fringes 
representing equal inclination. Such fringes may be observed when 
a beam of light with a continuous spectrum of incident angles im- 
pinges on a plate having parallel surfaces, i.e., a plate for which 
d is the same at all points (Fig. 148). 

Consider a beam of reflected rays contained in a given solid angle. 
Let us direct our attention to those rays lying along generating lines 
of a cone whose axis is normal to the plate. All rays lying on such a 
cone have the same value of r and yield lines representing equal 
inclination. 

The differences in the method of observation of lines representing 
equal thickness and lines representing equal inclination should be 
noted. Since lines representing equal inclination occur at infinity, 
a lens must be placed in the path of the rays to make them visible. 
This enables us to observe curves representing equal inclination in 
the focal plane of the lens. For normal incidence, lines representing 
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equal thickness may be observed with the naked eye on the surface 
of a wedge. If the light impinges on such a plate at an angle, lines 
representing equal thickness may be observed on the surface of a 
wedge only in the case of very thin films. Otherwise, the interference 
pattern is observed in two planes located above and below the wedge 


at a distance qint wherea. is the wedge angle. Toderivethis formula— 


which we leave to the reader—plot a ray incident on the surface of 
the wedge at an angle i and the rays reflected from the upper and 
lower surfaces. The observation plane of the interference pattern will 
pass through the point where the prolongations of the two reflected 
trays intersect. Note that the above formula is only valid for the 
case of an air wedge. 


135. Practical Applications of Interference 


Interference methods are widely used for the measurement of small 
distances and small changes in distances. They enable us to detect 
thickness changes of less than one-hundredth the wavelength of 
light. An accuracy of 10-”? cm may be achieved in the measurement of 
the unevenness of a crystal surface by interference methods. 

Many applications are based on the use of curves representing equal 
thickness. This method is widely used in the optical industry. 
For example, in order to check the quality of the surface of a glass 
plate, an air wedge is created between the plate under test and a stand- 
ard plate having an ideal flat surface and the fringes representing 
equal thickness are examined. The air wedge is formed by pressing 
the two plates together along one edge. If both surfaces are flat, the 
lines representing equal thickness will be parallel straight lines. 

Let us assume that the surface of the plate under test has a depres- 
sion or a bump. The lines representing equal thickness will then be 
distorted, i.e., they will bend around the defective area. If the 
angle of incidence of the light is varied, the fringes are displaced in 
one direction or the other, depending on whether the defect is a depres- 
sion or a bump. The patterns seen under a microscope in these cases 
are shown in Fig. 149. The first two pictures are those of defective 
samples. In the first the defect is located on the far right, while in 
the second the defect is on the left. The third picture is that of a 
sample without defects. 

This method may also be used for very accurate measurement of 
the coefficient of expansion. For this purpose, an air gap must be 
created between the surface of the object under test and a perfectly 
flat surface. As the object expands, the thickness of the air layer 
changes and the fringes representing equal thickness begin to move. 
If a line is displaced to such an extent that the next one takes its 
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place, the thickness of the air layer at this location has changed 
by ss P 

If, as is usually the case, the measurement is performed using 
monochromatic light, the fringes are very sharply delineated and the 





Fig. 149 


displacement of a line by one-hundredth of the distance between lines 
may be measured. 

Accurate measurement of the refractive index of a substance may 
be performed by means of an interference refractometer. In such an 
instrument, the interference 
between two light rays that 
are separated as much as 
possible is observed (Fig. 
150). For this purpose, a 
thick plate is used and a 
convenient angle of inci- 
dence selected (for ordinary 
glass, the best angle is 
about 50°). The rays travel- 
ling between the plates are 
separated and the substance 
being tested is placed in 
the path of one of them. 
This changes the optical 

Fig. 150 path of this ray and hence 

the path difference between 

the interfering rays. If the interferometer plates are exactly alike 

and perfectly parallel, the interfering rays cover the same distance 

and reinforce each other. If the plates are inclined with respect to 

each other, a path difference is created and the clarity of the field 
being observed is reduced. 
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Such is the situation for a beam of perfectly parallel rays. But if 
a slightly divergent beam impinges on the plate, a system of fringes 
representing equal inclination is seen through the eyepiece. In this 
case, the variations in the optical path difference are conveniently 
determined by counting the interference fringes passing the cross 
hair of the instrument. 

Let us assume that a body of length / and‘ refractive index n is 
placed in the path of one of the rays. If the refractive index of the 
medium is n,, the optical path difference changes by A=d (n—n,). 


Therefore, A fringes should pass through the eyepiece of the instru- 


ment. The accuracy of this method may be easily gauged from the 
fact that a displacement of 
one-tenth of the distance 
between lines is easily de- 
tected. For such a _ dis- 
placement, A—0.1A—0.5 x 
x107* cm, which for a 
length 7=10 cm enables us 
to distinguish a change of 
0.5 10°* in the refractive 
index. 

The well-known Michel- 
son interferometer (Fig. 
151) is used for the accu- 
rate measurement of length Fig. 151 
as well as for the determi- 
nation of the velocity of light (see p. 414). In this instrument, a 
parallel beam of monochromatic light impinges on a glass plate 
having parallel surfaces, one of which is covered with a translucent 
layer of silver. This plate is placed at a 45° angle to the incident 
beam. As a result, the beam is divided into two parts. One part 
moves parallel to the prolongation of the incident beam and the 
other is directed perpendicular to the incident beam (to the left). 
These rays impinge normally on two mirrors and return to the same 
points on the translucent plate from which they came. Each ray 
returning from the mirror is repeatedly divided at the plate. Part 
of the light returns to the source and the other part enters the tele. 
scope to the right. As a result, two coherent interfering rays appear 
in the field of the telescope. It is seen from the figure that after 
the first division at the lightly silvered surface the ray coming from 
the mirror opposite the telescope passes through the half-silvered 
plate twice. Therefore, in order to provide equal optical paths, the ray 
coming from -the other mirror is passed through an equalising plate 
identical to the first plate, but without the translucent layer oi silver, 
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magnified 300 times, illustrates the possibilities of the interfer- 
ence microscope. 

The drawbacks of this method include considerable loss of light in 
the interference mechanism and complexity of the microscope’s 
optical system. ‘ 

A further increase in the sensitivity of interference methods and, 
therefore, a further advance in the field of interference microscopy is 
possible by going over from a double-ray interferometer to a multi- 
ray interferometer. In 
double-ray interferometers, 
the screen illumination is 
proportional to 1+ cos kA, 
where ft is the displacement 
along the screen. Due to 
the smoothness of the tran- 
sition from maximum to 
minimum illumination, a 
small displacement of the 
interference fringes is dif- 
ficult to detect. In multi-ray interferometers, the situation is cone 
siderably improved. As an example, let us consider the Fabry and 
Perot interferometer. 

A Fabry and Perot interferometer (Fig. 153) consists of two rather 
thick glass or quartz plates partially silvered on their adjacent 
sides. These surfaces are perfectly parallel to each other and the 
region between the plates contains air. When a beam of light enters 
the interferometer, the beam is divided into a transmitted and re- 
flected component every time it impinges on one of the layers of silver. 
Thus, in the transmitted as well as in the reflected light, there is 
obtained a set of coherent light beams whose intensities decrease 
in geometric progression (see p. 350) and whose phases are displaced 
in arithmetic progression. In order to secure interference of a large 
number of rays, the decrease in amplitude during successive reflec- 
tions must not be large. This condition is met when the silver coat- 
ing on the plates has a reflection coefficient of 0.9 or more. The 
intensities of the rays in the transmitted light are then quite low, but 
there will be little variation from ray to ray. This makes it possible 
for a large number of rays (as many as 10-15) to take part in the 
formation of each illumination maximum. 2 

The interference pattern is obtained in the form of the usual 
rings representing equal inclination, but with one very important 
difference, namely, the principal maxima determined by the condi- 
tion 2 dn cosa =mA are now much narrower and their intensities are 
tens of times greater than the intensity of the background between 
them. Therefore, the interference pattern assumes the form of very 
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narrow bright fringes separated by broad dark intervals. The displace- 
ment of such a narrow maximum may be determined much more 
accurately than the displacement of a fringe in a double-ray inter- 
ferometer. An analogous narrowing of maxima occurs when the num- 
ber of slits in a diffraction grating is increased. 

Thus, multi-ray instruments sharply increase the sensitivity of 
interference methods. Such systems are indispensable in the investi- 
gation of the vertical structure of an object surface in reflected light. 
The magnification of details in the vertical direction may reach a 
value of 400,000, which makes it possible to reliably resolve details 
of the order of 5-10A. This is only several times greater than the 
distance between atoms! An example of such photography is the 
picture of the spiral growth of a crystal shown on p. 635. 


CHAPTER XXI 
SCATTERING 


136. Secondary Radiation 


Under the action of an electromagnetic wave, every molecule 
becomes a secondary radiator of electromagnetic waves. Due to the 
electric force, the electron cloud is displaced relative to the atomic 
nuclei and the molecule acquires a dipole moment varying in time as 
the frequency of the incident wave. The behaviour of such a mole- 
cule differs in no way from the behaviour of the elementary dipole 
discussed in Chapter XX. The intensity of the secondary wave is 
given by the formula derived on p. 323 { intensity ~ pF sin*4) and 
the spatial intensity distribution of the secondary radiation is shown 
in Fig. 133. 

In a number of cases, which will be discussed below, the phenome- 
non of secondary radiation leads to various phenomena of electro- 
magnetic wave scattering. By scattering, incidentally, we generally 
mean any electromagnetic wave propagation phenomenon that is 
not included under refraction, reflection and rectilinear propagation. 

The intensity formula given above is valid for any electromagnet- 
ic wave, However, the fact that the intensity increases sharply with 
radiation frequency explains why the effects of wave scattering by a 
molecule are not detectable when the wavelengths are very long. 
The scattering intensity of visible light is quite sufficient to produce 
significant effects. 

Light wavelengths are hundreds and thousands of times greater 
than the dimensions of ordinary molecules. Therefore, all the elec- 
trons of a molecule are made to vibrate in the same phase by the 
external field. For light waves, ultraviolet rays and even very soft 
X-rays (i.e., X-rays of long wavelength), a molecule behaves like 
an elementary electric dipole. 

The picture changes considerably in the case of X-rays having a 
wavelength of the order of 1A. Now, the dimensions of the molecule 
are larger than a wavelength and different portions of the molecule’s 
electron cloud vibrate in different phases. In order to determine the 
intensity of the scattered wave, we must take into account interfer- 
ence effects occurring between waves scattered by different parts of 
the molecule. 

In principle, this is not very difficult. First, it is necessary to di- 
vide the molecule’s electron cloud into small volumes. The dimensions 
of each such volume, Av,, must be much less than a wavelength. 
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Then, the electrons in this volume will scatter in the same phase. 
Designating the density of the electron cloud by ©, we obtain oeAuv, 
electrons in a volume Av,. The amplitude of the secondary wave 
created by the k-th volume is proportional to @Av,. The ampli- 
tudes of the scattered waves are added with due regard to the phase 
.differences between the elementary waves and this sum is then 
squared. It is found that the intensity distribution of the scattered 
wave differs significantly from the radiation pattern of a single 
dipole. This is understandable, of course, for there will be directions 
in which elementary waves scattered by different volume elements 
reinforce each other, i.e., act in phase, and, on the other hand, 
directions in which elementary waves tend to annul each other. An 
important conclusion to be drawn from such calculations is the fol- 
lowing:. When interference occurs between elementary waves emanat- 
ing from different volume elements of a particle, waves travelling 
backwards tend to annul each other in the final analysis; on the other 
hand, those travelling forward reinforce each other. 

We have been discussing secondary radiation from a molecule, 
but often the secondary radiator is a much bigger particle; namely, 
one composed of numerous molecules. It may be a‘particle of dust, 
colloidal substance, fog, crystalline substance, smoke, or a large 
albuminous molecule, etc. The nature of wave scattering by particles 
is determined by. the ratio of their dimensions to the wavelengths 
of the exciting electromagnetic wave. If the particle is small relative 
to the wavelength, the wave is scattered as by a single elementary 
dipole. If this is not the case, interference effects occur and the for- 
ward scattering predominates. 

_ Different parts of a particle may possess different scattering pow- 
ers. This is precisely the situation in the case of X-rays scattered by 
a molecule. A particle whose scattering power is the same through- 
out the volume is thesimplest body from the standpoint of scattering 
investigations. We shall confine our attention to such a system, 
for not only are the calculations simple in this case, but, in addition, 

a system of this kind is easily simulated experimentally by an = 

ture in an a screen. } 


137. Wave Diffraction at Apertures 


The amplitude of a wave scattered by a particle is determined be 
the distribution of scattering material in the particle. Particles 
(“apertures”) may be encountered in which the density of the scatter- 
ing material gradually decreases with increasing distance from the 
centre of the atom. On the other hand, more pronounced nonuni- 
formities may be encountered, e.g., inclusions and — at whose 
edges. the “ensity changes. abruptly. | 
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To make these patterns more understandable, let us determine 
the intensity distribution of scattered radiation* for the simple case 
of an aperture having the shape of a slit. 

Let a wave impinge normally on a slit cut in an opaque screen. 
Considering the slit divided into volume elements AV as shown in 

Fig. 155, let us write the expres- 

x AK sion for the wave emanating from 

an arbitrary volume element AV, 

at an angle @ to the incident 

wave. The waves from different 

volume elements AV, arrive at 

the point of observation in differ- 

ent phases. If the path difference 

is measured relative to the outer- 

most ray (on the side of deflec- 

tion), the rays emanating from the 

Fig. 155 other volume elements will cover 

a distance that is greater by the 

amount x sin g. Hence, these rays will be displaced in phase by 
21 
ny 





x, Sin @. 

The amplitude of the wave scattered by the k-th volume element 
is proportional to the “scattering” volume element AV, i.e., to the 
expression 

AV, cos (ot = Me sing) ‘ 
It is necessary to take the summation of the expressions for all 
volume elements. Instead of taking the summation we may integrate 
with respect to x, where the x-axis is taken along the width of the 
slit. Replacing AV, by Ax,, which is proportional to it, and going 
over to the limit, we obtain for the amplitude of the scattered wave 
at the angle gq: 


a 
A=k \ cos (ot —~ 3%. xsing ) dx, 
0 


where & is a coefficient of proportionality and a is the width of the 
slit. 

Introducing the variable 
2n 


z= wot — 7, 


xsing, 


* «Scattered radiation” and “diffracted radiation” have exactly the same 
physical meaning. The terms “diffraction” and “diffracted” are generally used 
when the scattering pattern has rather pronounced maxima and minima. When 
the nature of the interference pattern is not so evident, we speak of “scattering”, 
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we obtain 


dz=— * sing dx 








and, therefore, 
k ear . Qna . 
A= = [sin of —sin (ot — 7 sing) ]. 


TT sin @ &: 


Designating = sing by uw and performing a_ trigonometric 
transformation, we obtain 
L 
A= sin wu cos (w@t—u). 


Thus, the resulting oscillation at the point of observation has the 
: ka. 7 r . F 
amplitude -, Sinu, i.e., the observed intensity is 





o 2 situ 
Il=k’a rae 


This is the formula for the intensity distribution as a function of 
the scattering angle. 

In most diffraction experiments, we are interested in small values 
of the scattering angle @. The reasons for this will become clear 
later. Therefore, replacing sin @ by tan @ and since 


x 
tang = ; 
where x is the distance of the point of observation in the plane of 
the photographic plate to the centre of the diffraction pattern and f 
is the distance from the 
slit to the plate, we I 
obtain for uw the expres- 
sion 
ma Xx 


“=> > 


a f 


Fig. 156 shows the 234 


curve. Since u is propor- 
tional to x, this corre- 
sponds to the diffraction a 
pattern appearing on the , Fig. 156 
plate. 

The locations of the dark fringes are easily determined from the 
condition “== +n, where n is a whole number. Thus, the first zero 
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occurs at 
Mf 
t=, 
é a 
This value of x also represents the distance between two successive 
positions of zero intensity. 

From this formula, we can determine when diffraction phenomena 
may be detected for various wavelengths under various conditions. 
The diffraction of light (A=0.5 x 10-* cm) may be clearly observed 
under laboratory conditions if the aperture is of the order of 0.1 cm 
and the distance between the screen and plate is of the order of 
2 metres. Since for these values x=—1 mm, the effect is clearly 
visible. 

Rays of visible light yield noticeable diffraction from a tennis 
ball (a—5 cm), but at a greater distance. If the distance f is equal 
to 100 metres and the wavelength 4 is 5,000 A, then x =1 mm. Thus, 
in this case too the distance between positions of zero intensity of the 
scattered radiation is of the order of magnitude of 1 mm. 

Diffraction of radio waves may also be observed if a proper choice 


of conditions, as determined by the equation cat, is made. 


Assume that the values of f and A are fixed. The width of the slit 
greatly affects the diffraction pattern. If the width of the slit is large, 
x 0, i.e., the slit image focussed by the lens is infinitely narrow. 
As the width of the slit is decreased, the diffraction pattern begins 
to take shape and the first diffraction minimum begins to move further 
and further away from the centre of the pattern. Finally, when the 
slit is made so narrow that our approximation in the formula for u 
(the substitution of sin @ by tan g) is no longer justified, the image of 
the slit on the screen becomes smeared. A still further decrease in 
width, to the point where the wavelength and the width of the slit 
become equal, results in the slit yielding secondary radiation as a 
single source. The interference of primary waves disappears and the 
primary wave is radiated from the slit in all directions. 

For apertures and particles (or inclusions) having other shapes, the 
diffraction patterns appear entirely different (see Fig. 154). Never- 
theless, the general laws remain valid and the basic features of the 
pattern are maintained. Thus, for example, in the case of diffraction 
from a circular aperture or other circular nonuniformity, concentric 
rings are formed and the diameter of the smallest dark ring is 
1.22 4 , where D is the diameter of the aperture. 

Since diffraction patterns have maxima at different locations for 
different wavelengths, white light is resolved into its spectrum upon 


diffraction. Therefore, diffraction from a circular particle or aperture 
has the form of a rainbowed ring. 
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138. A System of Randomly Distributed Scatterers 


We have considered the behaviour of various secondary radiators 
of electromagnetic waves as a function of the ratio of their dimensions 
to the wavelength of the incident wave. But the properties of a radia- 
tor are only roughly determined by its dimensions. The detailed pat- 
tern is determined by the distribution of matter in the scattering 
particle. Only when the dimensions of a particle are small relative 
to the wavelength is the distribution of matter in the particle of no 
consequence. In this case, the particle scatters as a whole, i.e., as 
a single electric dipole. When this condition is not satisfied, the 
pattern is complex since it is determined by the interference of waves 
scattered from various volume elements of the particle. We have 
considered only one example of a scatterer whose dimensions are 
Jarger than a wavelength, namely, a homogeneous scattering particle, 
which may take the form of an aperture in an opaque screen. 

Now, let us consider the problem of scattering by systems of 
particles—e.g., a system of gas molecules, specks of dust or smoke 
particles; asystem of hoar-frost crystals on a windowpane; or a system 
of holes in a piece of gauze. In all such cases, the pattern becomes 
complex due to the fact that the electromagnetic waves emanating 
from the various scatterers may, generally speaking, interfere with 
each other. Now, the scattering pattern will depend not only on the 
properties of a scattering particle, but on the arrangement of the 
particles. Thus, it is important to know how close the scatterers are 
to one another and whether their spacing is regular or random. 
Then, depending on the circumstances, the waves scattered by the 
various particles may interfere to a maximum extent, partially or 
not at all. 

Confining ourselves to the extreme cases, let us first consider scat- 
tering by a system of randomly distributed particles—e.g., the scat- 
tering of X-rays by a large cluster of atoms or molecules having a 
random distribution. 

In the case of a large number of identical scattering centres (e.g., 
atoms, molecules or larger identical particles), the resulting scat- 
tering is determined, as we have already stated, by the scattering of 
a single centre (region) and the arrangement of the scattering centres. 
The scattering pattern in the case of uniform distribution of scat- 
tering centres is quite different from the pattern in the case of random 
distribution. 

If the scattering centres are randomly distributed as, for example, 
in the case of gas molecules, the waves scattered by different centres 
may be considered to be noncoherent. This is because in the case of 
randomly. distributed scattering centres the phase relationship be- 
tween waves emanating from different centres is quite arbitrary. 
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It is safe to say that the number of waves with positive amplitudes 
arriving at a point of observation (from different centres) will be 
exactly equal to the number of waves with negative amplitudes 
arriving there. The result is clear. Designating the amplitudes of the 
waves from the various centres by A,, Ag, Az, etc., the total ampli- 
tude at the point of observation is 


Aa A sed eA ae 


Since the intensity is proportional to the amplitude squared, we 
obtain. 


I~A?= Ai+ AL+ A;+...+2A,A, +2A4,A,+...+2A,A,+.... 


But among the double products there will be just as many positive 
terms as negative terms. Therefore, with a high degree of accuracy, 
the total is given by the sum of the amplitude-squared terms. In 
other words, the total intensity scattered by identical randomly 
distributed centres is expressed as follows: 


I=NA’, 


where N is the number of scattering centres and A is the scattering 
amplitude of one centre. 

Thus, it turns out that scattering by a large number of randomly 

distributed particles is very similar to scattering by a single particle. 

The only difference is that 

the effect is N times greater. 


Data on scattering by a 
single molecule are obtained 
by investigating the scattering 
of X-rays by a gas. Fig. 157 
illustrates a laboratory set-up 
for the study of X-ray scatter- 
ing by gases. A beam of X-rays 
is made monochromatic by 
reflecting it from a _ crystal, 
whereupon it is directed into a 
gas chamber. Scattering is de- 
termined by its action on a 
photographic film, i.e., the de- 


> gree of blackening is a measure 

> of the scattering intensity. We 

Fig. 157 can thus obtain the intensity as 
1g. a function of scattering angle— 


rapidly decreasing smooth 
curves for monatomic gases and decreasing curves having small maxima for 
polyatomic gases. By means of these curves and theoretical formulas, the 
electron-density distribution in a molecule may be determined. F 
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There are many examples of systems that scatter electromagnetic 
waves as a gas scatters X-rays. 

The scattering of light rays in aroom with dust is a familiar exam- 
ple. Through a chink in a window curtain, a narrow, rectilinear beam 
of light visible from all sides penetrates a room. Light waves act on a 
system of dust particles in much the same way as X-rays act on a 
system of molecules. The distances between specks of dust are quite 
large and the distribution of the particles is completely random. 
Hence, there is no interference between the waves scattered by differ- 
ent dust particles, and the scattering pattern is similar to that 
created by a single speck of dust. The sole difference lies in greater 
intensity, i.e., the intensity is proportional to the number of dust 
particles in the field of the primary beam of light. Each speck of dust 
behaves as an elementary electric dipole, for the dimensions of such 
a particle are less than the wavelength of light. Therefore, the laws 
applicable to the scattering of light by dust particles, i.e., the depend- 
ence on the wavelength of light and the nature of angular distribu- 
tion, are the same as for an elementary electric dipole (i.e., the in- 
tensity formula given on p. 323 and the intensity distribution shown 
in Fig. 133 are applicable here too). 

This agreement can also be easily demonstrated by comparing the 
diffraction pattern from a single aperture with that from a system of 
randomly distributed apertures. Experiments show that, as regards 
the relative intensity distribution of scattered light, these two 
diffraction patterns are entirely alike. Of course, in the case of N 
apertures, the intensity of the light scattered by the screen is N times 
greater than the intensity of the light scattered by an opaque screen 
having one aperture. 

- Since the scattering pattern due to a large number of randomly 
distributed centres has the same form as that due to a single centre, 
it becomes clear why a lantern observed through a window covered 
with hoar-frost has the familiar rainbowed halo. This pattern is 
simply the result of diffraction from the ice particles. Since their 
distribution is perfectly random, they behave as “circular” particles. 


139. Behaviour of a Perfectly Homogeneous Medium 


In this article we shall consider the other extreme case of the 
phenomenon of scattering. By a perfectly homogeneous medium we 
mean a system of scatterers that are distributed uniformly and 
continuously throughout a given region. Transparent glass consti- 
tutes such a medium with respect to light waves. Since the wavelength 
of light is:considerably greater than: the distance between the atoms 
in.glass, a piece of transparent glass may be considered to be divided 
into volume elements that are considerably smaller than the wave- 
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length, but at the same time still containing a large number of mole- 
cules. The medium may be considered homogeneous if the number 
of molecules in all such volume elements are approximately the 
same. 

The scattering of electromagnetic waves is not only a function of 
uniformity in regard to the number of radiators per unit volume but 
is also a function of uniformity in the orientation distribution of 
the radiators. In the final analysis, the scattering power of a body 
is determined by its dipole moment, which is composed of the indi- 
vidual dipole moments of the molecules contained in this volume. 
It may be stated, therefore, that the scattering power of a body is 
determined by the value of the dielectric constant or, since ¢e—n’, 
by its refractive index. Hence, for a given wavelength, a medium 
that is uniform as regards the scattering of electromagnetic waves 
must also be uniform as regards the refractive index. 

Observations on the behaviour of electromagnetic waves in perfectly 
homogeneous media show that no scattering takes place in such 
media. Thus, when a ray of light passes through a transparent body, 

_the ray cannot be seen from the side (compare this with the case of 
a light ray entering a room with dust). 

Each volume element of a homogeneous body is a wavelet source. 
Nevertheless, wave scattering does not occur. Only one explanation 
is possible, namely, the wavelets scattered by a homogeneous medium 
in any direction at an angle to the primary ray are completely an- 
nulled due to interference. This theorem is capable of rigorous proof 
but the proof will be dispensed with since it is quite evident that 
this is the only possible explanation. 

Nevertheless, the phenomenon of scattering does make itself felt 
quite considerably in a homogeneous medium. Scattered waves annul 
each other in all directions except one, namely, the direction in 
which the primary wave is propagated. Forward scattering does not 
represent simply a superposition on the primary wave but a change 
in its velocity as well. It turns out that the phenomenon of electro- 
magnetic wave refraction, which we have already discussed, may be 
interpreted as a natural consequence of scattering. 

An electromagnetic wave propagated in a medium may be represent- 
ed as the sum of the primary wave and the scattered waves. Theo- 
retical calculations show that the superposition of these waves leads 
to the retardation of the primary wave. 


140. Scattering in a Nonhomogeneous Medium 


A substance uniformly distributed, in the sense that we have 
just discussed, does not scatter electromagnetic waves. Although all 
portions of this substance create wavelets, secondary radiation is 


See. 140] Scattering in a Nonhomogeneous Medium 371 
Sea a eA Es Oy Se ee 


not observed from the sides: No matter what point in the region we 
chose as our point of observation, it can be rigorously proved that 
the waves scattered by a uniform substance annul each other due to 
interference. Since for any particular wavelet there always corre- 
sponds another that is exactly opposite in phase, the net effect is com- 
plete cancellation. 

Now, let us assume that in some limited region the substance has 
a greater density than in the surrounding medium, i.e., that it has 
an excess of dipoles per unit volume. Then, all wavelets are annulled 
except those created by this excess density. As always, the scattered 
radiation is determined by taking the sum of the wavelet amplitudes, 
whereby the phase differences between the wavelets arriving at the 
point of observation must, of course, be taken into account. 

Is the situation any different if the density of the scattering region 
is less, instead of greater, than the surrounding medium? Scattering 
will cease if matter is added to such ascattering region until the me- 
dium becomes homogeneous. Clearly, the net effect is the same if we 
add a certain amount to a given quantity or subtract this same amount. 
Hence, the scattering from a region of lower density is equal to the 
scattering from the missing substance, i.e., the substance required to 
make the medium homogeneous. 

Thus, it is only important that the scattering region have a density 
distribution of matter that differs from the surrounding medium. 
Moreover, as regards scattering, the effect of a positive density devia- 
tion is indistinguishable from a negative density deviation of the 
same magnitude. For example, the scattering from porous glass is 
the same as the scattering from glass containing randomly distribut- 
ed inclusions of exactly the same size as the pores. 

Due to the large wavelengths of radio waves, scattering will take 
place in this case only when the nonuniformity of the density occurs 
on a relatively large scale. For example, in order for the scattering 
of kilometre waves to be detectable, the extent of the deviations 
from average density that are intercepted by the waves must be at 
least several hundred metres. The waves are unable to “detect” small- 
er inclusions or density gaps. 

The scattering of light waves is detectable when disturbances in 
the distribution of scattering matter are at least of the order of sever- 
al tenths of a micron. Thus, light waves are unable to detect non- 
uniformities in the distribution of electrons in a molecule or in the 
region between two adjacent molecules since these phenomena are 
restricted to regions whose dimensions are much less than several 
tenths of a micron. The situation is different as regards X-rays. In 
this case, the wavelengths are of the same order of magnitude as the 
dimensions of an atom. Hence, an individual atom appears as an 
“inclusion in a void”, 
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The scattering of light waves on nonuniformities is a common 
phenomenon. Nonuniformities in a scattering substance are easily 
recognised by the external appearance of the medium, i. e., by its 
turbid appearance. The conditions needed for light scattering prevail 
in opalescent glass, dust-laden air, etc. In all these cases, there are 
random disturbances of the density of the substance, whereby 
ie dimensions of the disturbed regions approach the wavelength of 
ight. 

As was indicated above, if a particle or nonuniformity scatters as 
a single dipole, in other words, if its dimensions are no greater than 
one-tenth to one-twentieth of the wavelength, the scattering inten- 
sity as given by the dipole formula (see p. 361) is proportional to the 
frequency, and inversely proportional to the wavelength, taken to 
the fourth power. This is the explanation for the following interest- 
ing phenomenon. When white light is scattered by a medium with 
nonuniformities, the medium acquires a blue coloration since the 
intensity of the scattered blue rays, i.e., those of shortest wave- 
length, is considerably greater than that of the others. On the other 
hand, after passing through ascattering medium, white light becomes 
reddish since the blue portion of the spectrum is impoverished due 
to greater scattering. 

For light waves, not only are turbid media nonhomogeneous. A 
homogeneous gas or liquid is optically nonhomogeneous due to the 
presence of density fluctuations. This may be shown by calculations. 
Light waves scattered in a region whose linear dimension is 0.02 mi- 
cron, i.e., one-twentieth of the wavelength 4, may be considered to 
be in phase. There are, on the average, 215 molecules in such a gas 
volume (8 x 10~?**”*) under normal conditions. The relative fluctuation 
in the number of particles according to the laws of statistical phys- 


ics is a , i.e., approximately 4 per cent. This is a perfectly per- 


ceptible nonuniformity as the scattering of light by air shows. 
The blue colour of thesky is due to such scattering. If the atmos- 
phere did not scatter the light of the Sun, the sky would appear black. ° 
It should be noted that the colour of the sky is due to the scattering 
of a relatively small portion of the energy: in a unit volume, about 
10-7 of the primary wave energy is scattered. 
' Scattering from density fluctuations in a substance is referred to 
as molecular scattering since this scattering depends on the molecu- 
lar structure of the substance rather than its impurities. Investiga- 
tion of the molecular scattering of liquids is of interest as a method 
for the determination of certain features of molecular structure. 
As regards the nature of scattering, a nonhomogeneous medium in 
which the regions of deviation from the average density are suffi- 
ciently far apart and quite randomly distributed ‘does not differ from 
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a system of randomly distributed scattering centres (Sec. 138). For 
the most part, however, in discontinuous media (such as fluids and 
amorphous solid bodies in the case of X-rays, opalescent glass and 
colloidal systems in the case of light rays, and the atmosphere in 
the case of radio waves) the interference of waves scattered by neigh- 
bouring regions of lower or higher density affects the form of the scat- 
tering pattern. This type of interference yields a scattering pattern 
that significantly differs from the ideal scattering pattern obtained 
from a single electric dipole. 

- We have considered scattering of electromagnetic waves by a 
system of randomly distributed particles, scattering in a perfectly 
homogeneous medium, and finally, as an intermediate case, scatter- 
ing in a nonhomogeneous medium. One more important case remains 
to be discussed, namely, scattering of electromagnetic waves by 
systems of uniformly distributed centres. Under this case, the follow- 
ing systems will be considered: a diffraction grating for light waves, 
directional radiators for radio waves and crystals for X-rays. 


141. Diffraction Grating 


A diffraction grating may be constructed using a glass plate coated 
with a thin layer of aluminium. By means of a special device, uni- 
formly spaced lines are inscribed on this plate with a soft ivory tool. 
In such a “grating”, thenonuniformities (lines) are uniformly distrib- 
uted, leading to a number of light scattering peculiarities. 

We shall be speaking about an optical diffraction grating, but the 
discussion applies to any regular distribution of nonuniformities and 
scattering centres and to all electromagnetic waves, i.e., from the 
shortest to the longest (kilometre waves). The discussion will be 
restricted to diffraction using parallel rays, which may be realised 
as indicated in Sec. 137. 

If all scattering centres are identical, as is undoubtedly the case 
in an optical diffraction grating, the diffraction pattern may be 
determined in the following manner. 

Consider the amplitude of the wave emerging at an angle @ to the 
‘incident wave. The total amplitude is equal to the sum of the ampli- 
tudes of the waves scattered by the individual centres. If the waves 
from the individual centres arrived at the point of observation in 
phase, the total amplitude would be equal to the product of the 
number of centres N and the amplitude of an individual centre f. 
However, the wave from each centre is displaced in phase relative 
to the wave from an adjacent centre, and it may be assumed that the 
magnitude of the displacement is the same in each case. Waves from 
different centres interfere with each other and the resultant intensity 
is equal-to Lf? rather than Vf*, where L is a quantity greater than NV 
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for the directions in which the waves reinforce each other and less 
than N for the directions in which they arrive, in the main, out of 
phase and annul each other. 

The directions for which the waves from all centres reinforce each 
other may be easily determined with the aid of Fig. 158. The path 
difference between waves emanating from corresponding points of 
adjacent centres is equal to asin q. If this path difference is equal to 
a whole number of wavelengths, the waves reinforce each other: 
asin @=nd (maximum condition). As may be seen, there are several 
such directions. If the wave impinging on the grating is not monochro- 
matic, the grating resolves the wave into its spectrum. Moreover, 
there will be several spectra rather than just a single spectrum. The 





number n appearing in the above equation is called, therefore, the 
order of the spectrum. 

Since it is somewhat cumbersome to calculate the intensity distri- 
bution of ascattered wave, let us merely confine ourselves to one 
important problem, namely, the determination of the width of a 
diffraction maximum. We are interested in determining how rapidly 
the intensity of a diffraction maximum, which occurs at an angle 
@ satisfying the equation a sin p=nd, decreases. Does one maximum 
immediately pass over into the next or is there a large interval 
between maxima? The case of a grating consisting of a large number 
of scattering centres (lines) is of considerable practical importance. 
Imagine the grating divided into two parts. Naw: compare a pair 


of rays coming from the first centre and the ae 1 | st centre, the 
second centre and the (5 TREE 2)\ndcentre. At maximum reinforcement 
of the waves, the path difference between such pairs of rays is 
equal to Wah, If the path of the rays is slightly changed, the rays 


being inclined so that the path difference increases by ae maxi- 
mum reinforcement of the superimposed waves is replaced by total 
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annulment. The first wave cancels the vo 1) st, the second can- 
cels the e+e nd, etc. Exact calculations show that for posi- 
tions further away from the maximum the intensity remains 
practically equal to zero until the angle of inclination @ approaches 
the position of the next maximum. 

The angle at which maximum diffraction of n-th order occurs is 
given by the formula sing=™, 


If we designate the half-width angle of the maximum by Ag, then 
for the angle (p+Aq) we may write the condition 


N 2 N nr 
3 asin (p + Ag) =5 th+ z 
Hence, 
, A 
sin(p + Ag)= 44 
or 


sin (p+Ag)—sing=>. 


The distance between two successive maxima is determined by the 
expression 
sin @, — sin Qa. 


We see that the half-width of a line is, roughly speaking, o of the 


distance between maxima. When N is large, i.e., in the case of grat- 
ings consisting of a large number of scattering centres, the diffrac- 
tion lines are extremely narrow and the resolution in the spectrum 
obtained from the grating is very fine. Imagine, for example, that 
light containing two close waves, 4 and A-+6A, impinges on a 
grating. For simplicity, assume we are concerned with scatter- 
ing at angles less than 20°; hence, sin pq. Then, in the n-th order 
these two lines are displaced by the angle 6g, which, as can be seen 


from the condition oxsing=™, is approximately equal tom 6A. 
The width of the maximum for each wave may be determined from 
the equation 

dp=sin (p+ 5) —sine=~. 


Evidently, these two lines may be distinguished (in optics, we 


say resolved) if 
a Na 
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The expression A= nN is an indicator of the resolving power of 
the grating. 


Example. In a good diffraction grating, the distance a between lines is 
~ 107? mm and the number of lines NV is equal to 100,000. Then, the resolving 
power for the spectrum of second order is = nN = 200,000. This means that 
for 4=6,000A, for example, two lines whose wavelength difference is 0.03A 
may be resolved. 


Let us consider the intensity of a diffracted beam. The waves 
directed towards a maximum point act in phase. If f is the amplitude 
of the wave scattered by one centre, the total amplitude in the maxi- 
mum direction is Nf and the intensity is N?/?. Thus, the height of a 
diffraction maximum is proportional to the number of scattering 
centres squared and, since the width of a maximum is inversely pro- 
portional to N, its area (i.e., the integral of the maximum intensity) 
is proportional to N taken to the first power. If different maxima are 
compared, it will be seen that the ratio of their heights (or, what 
amounts to the same, their areas) depends on the value for these di- 
rections of the amplitude f of the scattering from one centre. 

: Thus,- the period of a grating determines the locations of the maxi- 
ma, while the form (in the broad sense of the word) of a line or scat- 
tering centre determines the intensity of the maxima. . 

Let us assume the angles g,, 9,,,,etc., are determined by the period 
of a grating. Scattered rays occur only at these angles. But what will 
be the intensity of these rays for the first, second, etc., orders of dif- 
fraction? This depends on the amplitude values of one scattering 
centre for these scattering angles. Thus, for example, the amplitude ; 
may be a maximum at the angle @,. Then, the second order diffraction 
will be represented by an intense line. If at the angle g,, the ampli- 
tude f is close to zero, then the third order line will not appear in 
the diffraction spectrum, etc. This is illustrated in’ Fig. 159, which 
shows the diffraction spectra and scattering factors f of one centre 
(dotted curves) for two different grating arrangements. 

These principles form the basis of any study of structure by means 
of diffraction spectra. The distance between diffraction lines enables 
us to determine the period of the grating—assuming, of course, that 
the wavelength is known—and the intensity of lines of different order 
enable us to determine the structure of a scattering centre. 


Example, Consider a diffraction grating for which a=3X 107? mm and 
N=1,000. Assume a parallel monochromatic beam of light (A=5,000A) im- 
pinges on this grating. Diffraction maxima afte visible at the angles given by 
sing, =—=%, and the width of a diffraction maximum is equal to 

: 


2 = TaN The obtained results are valid for any scattering centre form. To 
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calculate the relative intensity of the diffraction maxima, it is necessary to 
consider specific scattering centres. Let us examine two cases: 
1. The scattering centres are single strips of width b=-a=0.75X107? mm 


(Fig. 159a). In Sec. 137, we obtained a formula for the intensity of a wave 
diffracted at a slit. The magnitude of the intensity is proportional to the am- 
plitude squared of the scattering from one strip: f?~ = , Where uae sin 9. 
The intensity of the n-th diffraction maximum is determined by the magnitude 


of f, in the direction @,, which is determined, in turn, from the equation 


sil Pa=—y- If f{ is taken as 100, the other intensities have the following 





values: 


2. The scattering centres are double strips of width 6 = + a=0.75107? mm 
each. The grating period a is the same as before (Fig. 159b). Clearly, the loca- 
tion and width of the diffraction maxima have not changed. Calculations 
similar to those performed in Sec. 137 show that for two slits 





pa tite (1400 [% o-bedsno]} 


From this expression, the relative intensities of the diffraction maxima are 
easily determined. Assuming, as before, that fj= 100, then 


LS=Re fs. aos fa i287, 


142. Directed Radiators of Radio Waves 


In certain radio engineering applications, particularly radar, 
it is required to direct a radio beam into space in such a manner 
that the transmitted energy is concentrated in a very small solid 
angle. One solution of this problem is 
to utilise a linear array of antennas. 

In Sec. 141, we saw that for uniform 
spacing of scattering centres the radiated 
energy is concentrated in specific direc- 
tions. If radiators of radio waves are 
arranged in a single row (see Fig. 160) 
with a distance a between adjacent an- 
tennas, and if all the antennas are fed 
synchronously, such a radiator array 
will in no way differ from a scattering 
diffraction grating. The fact that we are 
dealing here with primary waves in no 
way detracts from the applicability of 
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the discussion in the preceding article 
to the present case. It is only necessary 
that it be permissible to consider the 
radiation from different antennas cohe- 
rent, which is the case if the antennas 
are fed synchronously from a generator 
of oscillations. 

If the antenna array is so dense that 
the distance between adjacent dipoles 
is less than a wavelength, even first 
order diffraction is impossible, as the 
equation a sin @=nd shows. Only the 
zero order remains. This means that 
there are only two radiation maxima— 
one forms a 0° angle with the normal 





Fig. 161 


to the array and the other a 180° angle. The point made in Sec. 141 
about the width of the maximum is valid here too, namely, the 
larger the total number of radiators, the smaller the solid angle in 
which the beam intensity reaches an appreciable value. 

However, in practice, it is inconvenient to have radiation of equal 
intensity in two opposite directions. To avoid this, double arrays 
of dipoles are used (see Fig. 161). The antennas of each dipole pair ate 
separated by a distance of 4/4 and a phase difference of 90° exists 
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between the two currents. As a result, one of the two maxima is 
annulled and all the energy is concentrated into one diffraction 
maximum. The phase relationships existing for each dipole pair in 
such an array may be explained as follows. For the “forward” waves: 
if the waves were propagated synchronously, the path difference 
between them would be 4/4; but the antennas do not operate 
synchronously, i.e., the wave radiated by the “front” dipole lags 
by 90°, which compensates for its-lead due to the path difference. 
The situation is different for waves propagated “backwards”. The 
displacements due to the path difference and the phase difference 
‘ between the currents in the antennas are in the same direction. 
Hence, the total phase difference is 180°. As a result, there is no 
radiation in the backward direction. 

If the antennas are arranged in a row, a narrow beam may be 
obtained in the plane perpendicular to the antennas. But if a small 
beam in space is desired, a more complex system of oscillator 
elements must be used. That is why radar antennas have such strange 
appearances (see Fig. 162). 


CHAPTER XXII 
DIFFRACTION OF X-RAYS BY CRYSTALS * 


143. Crystals as Diffraction Gratings 


As has been already indicated, a diffraction grating usually con- 
sists of a piece of glass on which equally spaced lines are scratched. 
What is essential here to obtain one of the typical diffraction patterns 
discussed above? Is it the presence of glass, the shape of the lines, 
the thickness of the glass, or the width of the “slits”? A careful study 
of Sec. 141 shows that the essential element is the periodic repeti- 
tion of the nonuniformities of the scattering substance. Thus, irre- 
spective of the cause of the scattering and the nature of the nonuni- 
formities, as long as these nonuniformities are repeated with a 
periodicity a, scattering maxima will occur at angles @ satisfying 
the equation a sin g=nd. Such a pattern is given by lines of any 
shape scratched on any glass, or slits of any shape made in any screen. 
It is only necessary that the distribution of matter repeat with a 
periodicity a. 

To be sure, certain differences in patterns may exist. The inten- 
sities of rays diffracted in different orders may differ depending 
on the shape of the slit. The distribution of substance within a 
repeating nonuniformity affects the scattering intensity f?, which 
for different orders may have different values. 

Now that we have reviewed the results of Sec. 141, let us turn 
to crystals. The basic feature of crystals distinguishing them from 
other bodies is the periodic distribution of matter. Along any di- 
rection of a crystal, the time-averaged electron density is periodi- 
cally repeated. In the simplest case, the electron density distribu- 
tion will appear as shown in Fig. 163. This figure shows the electron 
density (the number of electrons per cubic Angstrom) along a line 
parallel to the edge of a cube of rock salt. An electron density max- 
imum corresponds to the centre of an atom. The large maximum 
corresponds to the centre of a chlorine atom and the small maximum 
to that of a sodium atom. The pattern repeats itself after every oth- 
er atom, and the period of the electron density distribution aleng 
the line is equal to 5.6 A. This is the pattern of the distribution 
obtained along a specific line. Along a slightly displaced parallel 
line, the distribution will be different. 


* Before reading Chapters XXII and XXIII, it is recommended that Chap- 
ter XXXII be perused. 
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However, a crystal is a three-dimensional formation, and the 
repeating element is a three-dimensional cell. The electron density 
distribution of a cell cannot be illustrated graphically, but it is 
sufficient to know that the cells repeat in space. The similarities 
and differences between a crystal and a diffraction grating are evi- 
dent. A crystal is a three-dimensional diffraction grating in which 
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the nonhomogeneous element repeats regularly in three-dimensions 
rather than along a line. The role of “slit”, i.e., repeating nonuni- 
formity, is played by the unit cell of the crystal. 

Let us determine the nature of the diffraction pattern created 
by a crystal. 

X-rays are scattered by electrons. The nonuniformities in the 
electron density are of such a nature that wavelengths of the order 
of 1-2A yield perceptible diffraction. In order to determine the 
direction of the diffracted rays, the wavelets coming from all the 
cells must be added. Of course, the amplitudes of these wavelets 
for a given direction are the same. The difficulty arises in taking 
account of the phase differences between wavelets scattered by in- 
dividual cells. These wavelets must be added for every direction 
and the directions in which the wavelets reinforce each other to a 
maximum extent determined. ; 

The problem can be solved in various ways since various summa- 
tion sequences are possible. For example, first wavelets from the 
cells along edge a may be added, then wavelets from all columns 
in the plane ab, etc. But we shall use a much simpler method. This 
is the method proposed by the founders of X-ray structural anal- 
ysis, the two Braggs—father and son, British scientists. (The same 
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idea was proposed independently by the Russian crystallographer 
Vulf.) In a crystal, parallel planes can always be passed through 
the lattice nodes in numerous ways; the crystal is composed of the 
layers between such successive planes. Let us construct a normal 
to these layers and imagine the electron density projected onto the 
direction of the normal. Clearly, a periodic electron density distri- 
bution exists along the 
normal. The period d is 
appropriately called the 
interplanar distance. 
The condition for 
‘maximum reinforcement 
of waves scattered by 
the cells of one layer is 
that the angle of inci- 
dence equals the angle 
of reflection. This con- 
clusion is based on Huy- 
gen’s principle, for only 
when the above condi- Fig. 164 
tion is satisfied are the 
scattered waves propagated in phase, and thus reinforced. Waves 
of successive layers reinforce each other when certain additional 
conditions are met. Fig. 164 shows that the path difference be- 
tween rays “reflected” from the corresponding elements of two adja- 
cent layers is equal to 2d sin 6. Thus, diffracted rays are obtained 
when the condition 





2d sinf =na 
is satisfied. 


A diffracted beam is obtained when a system of planes may be 
found, among the countless such systems into which the crystal may 
be divided, which satisfies the condition 2dsin#=nd. Of course, 
this requirement may simultaneously be met by several systems of 
planes. However, the more likely situation is that diffraction does 
not occur for an arbitrary direction of a monochromatic beam. 
Hence, in order to observe diffraction, the crystal must be turned 
until a suitable angle 0 is found. 


Example. The interplanar distance in a calcite crystal is equal to 3.029A, 
In X-ray structural analysis, radiation from a copper anode is often used. Since 
this radiation has a wavelength of 1.54 A, the diffraction maximum of first order 


ane 
occurs at 9=arcsin ga 14°40". 
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- 144, Determination of the Parameters of a Crystal Cell 


Having experimentally determined the angles 20 formed by 
rays diffracted from a crystal, one can determine, if 4 is known, 
the interplanar distances, and hence the structural period in any 
direction. 

If the lattice is cubic, it is described by one parameter, i.e., 
the edge of a cube. A rhombic lattice is described by three mutually 
perpendicular periods a, 6 and ec. 

Rays may be reflected from any system of planes, including the 
planes parallel to the main lattice faces ab, bc and ac, by appro- 
priate placement of the crystal 
relative to the beam. A.series of 
such measurements suffices in all 
cases to fully “probe” the lattice 
structure, i.e., measure the length 
of the edges of a unit cell and, for 
i the case of low-order symmetry, 
determine the angles between them. 

These measurements are quite 
interesting. Having determined the 
volume V of a crystal cell and the 
density 6 of the substance, the 
mass of substance in a cell may 
be directly calculated from M =V6. 
The molecular weight of a cell is 

Fig. 165 then obtained by dividing this 

quantity by the mass of a hydrogen 

atom: m,,—1.66 x 10° ** gm. Of course, the number of molecules in 
a cell must be an integer. Moreover, in many cases, symmetry 
considerations restrict the possible number of molecules. For exam- 
ple, in a rhombic cell, the number of molecules cannot: be less 
than four. Thus, cell measurements are of great importance in: the 
determination of the molecular weight of a substance. ; 

In the simplest cases, the determination of a unit cell suffices 
to determine the structure. A unit cell of iron is illustrated in Fig. 
165. For such a cell, the distances between atoms are as indicated. 
These data are obtained by simply measuring the interplanar dis- 
tances. The investigator reasons as follows: A crystal of iron is cu- 
bic. Let us measure the fundamental interplanar distance for the 
system of planes parallel to the side of a cube. This yields the value 
1.4. A. Now, let uscalculate the number of iron atoms in a cube whose 
edge is 1.4A long. The mass of an iron atom is mre =92.6 X 107** gm 
and the density of iron is6re=7.88 gm/cm*. Thus, a cube of vol- 
ume (1.4)* x 10~*4 cm® contains a mass of (1.4)* x 1077* x 7.88=21.7 x 
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x 10-** gm. But this mass is one-fourth that of an iron atom. Hence, 
the edge of a unit cell of iron is greater than 1.4 A. Let us assume 
that it is twice this length. Then, the 
edge of a cube is 2.8 A and we ob- 
‘tain 2 atoms per cell. 

Since the crystal is cubic and pos- 
sesses an axis of symmetry of fourth 
order (see p. 584), the second atom can 
be located only at the centre of a unit 
cube. Let us check the validity of the 
assumption of two atoms per cell. If 
the assumption is correct, the diago- 
nal interplanar distance shown in Fig. 
166 should equal 1.4//2A. This is the 
value obtained experimentally. Hence, 
the above model of the structure is 
correct. 

For many metals, alloys and sim- 
ple salts whose formulas are of the 
type AB, such simple reasoning is Fig. 166 
quite often sufficient to determine 
the interatomic spacing. If there are many atoms in a cell and 
the shape of the cell is not cubic, the structure may be deter- 
mined only by utilising beam intensity data in addition to data 
on the geometry of the diffraction pattern. 





145. Intensity of Diffracted Beams 


Analogously to the case of a linear grating, the intensity of a 
beam diffracted by:a crystal is equal to N’F’, i.e., it-is pros 
portional to F*, the square of the amplitude of the wave scat: 
tered by a unit crystal cell in a given direction, and to N’, the 
square of the number of unit cells in the illuminated vol: 
ume. , 

The quantity F’ is uniquely related to the crystal structure, 
i.e., to the nature of the electron density distribution in the 
cell. 

As has been already indicated, the quantity F* for a given dif- 
fracted beam and system of “reflecting” planes depends on the projecé 
tion of the electron density.on the normal. By means of parallel 
planes, let us divide a crystal layer into infinitely thin layers dz. 
If @ is the electron density of a cell, then edz is the number of elec- 
trons in the layer-dz. All the electrons of a thin layer are scattered 
in phase and yield the wave gdzcos (wi-+q). Hence, the wave 
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amplitude of a cell is expressed by 


Q cos (wt + gq) dz. 


f= 


The integral is taken over a single period (the interplanar distance); 
differ, 


the values of @ and @ for each z 





Fig. 167 
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We shall not concern ourselves with the problem of determining 
the electron density of a crystal at all points of a unit cell. In all 
cases the basis of the solution is the above equation. It enables us 
to calculate the amplitude of any diffracted beam if the electron 
density of the crystals is known. Of interest, however, is the converse 
problem—the determination of the electron distribution in a crys- 
tal from the experimentally determined intensities of diffracted 
beams. This problem has been solved for very complex crystals 
containing hundreds of atoms per unit cell. 

In the case of anthracene crystals, the intensity of about 600 
diffracted beams has been measured. Using these data, the values 
of the electron density at all points of such a cell were determined. 
Fig. 167 illustrates the electron density for a cross-section through 
the centres of atoms of an anthracene molecule. (The method adopt- 
ed to indicate the electron density is that used by topographers 
to indicate elevation. Electron density contour lines on the electron 
‘density diagram correspond to elevation contour lines on a topo- 
graphical map.) The fourteen distinct peaks represent fourteen hy- 
drogen atoms. Experiments show that the distance between adjacent 
peaks is 1.4 A. The height of an electron density maximum is pro- 
portional to the number of electrons in the atom. Carbon atoms 
have six electrons each and hydrogen atoms one. Not all of the 
ten peaks corresponding to the centres of the ten hydrogen atoms 
contained in an anthracene molecule are fully sketched in the dia- 
gram. The chemical formula of anthracene is C,,H,,. 


146. Methods of X-Ray Analysis 


The measurement of the angles 2) formed between the difiracted 
beams and the beam incident on a crystal, as well as the intensi- 
ties of the diffracted beams, may be accomplished using an ionisa- 
tion chamber (see p. 528) or a photographic method. A photographic 
film on which the traces of many diffracted beams are simultaneous- 
ly recorded is called a roentgenogram. 

But how can the traces of several beams be obtained on a single 
film when, as has been already indicated, the condition nA =2d sin4 
in all likelihood will not be fulfilled even once for an arbitrary orien- 
tation of the crystal relative to the beam? This can be accom- 
plished in three ways: 

1) by rotating the crystal, thereby setting various systems of 
planes in a reflecting position; 

2) by illuminating the crystal with a continuous spectrum of 
wavelengths in a band sufficiently broad so that almost every system 
of planes finds a “suitable” wavelength in the spectrum; and 
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- 3) by obtaining a roentgenogram of a powder, since in a powder 
the diffraction condition for any d is always fulfilled for some crys- 
tals. 

The first method is known as the crystal rotation method, the 
second as the Laue method, after the German physicist whose name 
is associated with the discovery of beam diffraction, and the third 
as the powder or Debye method, after the scientist who proposed 
this method. The Laue method has extremely limited application. 


nH 
Sy 
~n 
8 : 
BS 
a nx Q 
Primary Ss + 
beam S 
~ 
=) PS 
rt) 3S 
S Ss 
SS Cty 
i= 
oS 
[= ) 





Fig. 168 


In practice, the rotation method is used for the investigation of crys- 
tal structure, i.e., for the determination of arrangement of atoms 
and the Debye method is used for special problems arising in connec- 
tion with the investigation of fine crystalline substances. 

The purpose of a rotating roentgenogram is to gather data on 
one film regarding existing interplanar distances and intensities 
of. the respective beams. However, it is also necessary to know the 
orientation of the system of planes relative to the crystal axes. This 
requires knowledge not only of the location of a particular spot 
on the film, but also of the instantaneous orientation of the crystal 
when it arose. In order for the roentgenogram to provide this infor- 
mation as well, the film is displaced during filming. Such filming 
methods are known as roentgengoniometric methods. 

Fine crystalline substances are used much more often than mon- 
ocrystals for the investigation of crystal structure by .means of 
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X-rays. Fig. 168 illustrates how a roentgenogram is‘ obtained by 
the Debye or powder method. 

Let us direct our attention to a specific system of planes sepa- 
rated from one another by a distance d and having a normal zn. 
Assume that the normals na of the crystals are directed in all direc- 
tions. A beam “reflected” from a plane is not produced by reflec- 
tion from all the crystals, but only from those whose planes are at 
an angle? to the beam, where @ satisfies the condition 2d:sin O==nA. 
Accordingly, the normals of these crystals are at an angle of 90°-0 
to the primary beam. All crystals whose normals lie on the cone hav- 
ing an apex angle 2(90°-0) are in a reflecting position for planes 
separated from one another by the distance d. Therefore, the “re- 
flected” beams also form a cone, and this cone has an apex angle of 
40. Ring patterns are obtained when these cones are intersected by 
a photographic film or plate. 

If we are concerned with values of tnot exceeding 20-25°, 
the roentgenogram may be taken on a flat plate. In this case, we 
obtain a system of concentric rings. However, if information on 
all interplanar distances is desired, and scattering over the entire 
interval of angles possible is to be analysed, cameras with cylindri- 
cal film are used. In this case, the height of the film is reduced, so 
that only parts of the rings are photographed. 

If for some problem or other it is important to ascertain the in- 
terplanar distances more accurately, we usually resort to “rear” 
filming, i.e., an arrangement whereby diffraction cones having 
apex angles close to 360° are filmed. 

In determining the scattering angle 0 from the film, a measure- 
ment error A?’ inevitably arises. Let us see how the magnitude of 
this error is reflected in the determination of the magnitude of the 
interplanar distance. After differentiating the diffraction condition 
2d sin®=nd; we obtain 


[=F] =cot 8a9. 


It is seen that the accuracy in measuring the interplanar distance 
rapidly increases as the angle @ approaches 90°. The diffraction 
angle may be easily measured with an accuracy of the order of 0.1°. 
Hence, the above equation shows that along lines for which the angle 
® is equal to 65°, 75° and 85° the interplanar distance may be meas- 
ured with an accuracy of 0.13%, 0.08% and 0.05%, respectively. 
Using special cameras, the method of rear filming yields very good 
results, enabling interplanar distances to be determined with an 
accuracy of 0.00001 A. For best results, the radiation wavelength 
is selected so that the scattering angle approaches 90°. 
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All three types of Debye filming are widely used in the investi- 
gation of the structure of matter. Every substance yields a specific 
system of lines that is characteristic only of it. A phase transforma- 
tion results in the replacement of one system of lines by another 
and the new phase may also be determined from the roentgenogram. 
Phase analysis is one of the most important applications of these 
films. ; 

Now, let us assume that the crystals of the substance have a 
favoured orientation. In this case, the normals m of one or another 
system of planes no longer assume all directions in space and are 
not uniformly distributed on the cone of normals shown in Fig. 168. 
But if the cone of normals is not complete, the same holds true for 
the cone of diffracted rays. Hence, instead of solid rings, broken 
rings appear on the film. Such rings indicate the presence of a fa- 
voured crystal orientation or structure. Using the Debye method, 
a detailed study may be made of the nature of the preferred crystal 
orientation arising, as a rule, for various kinds of plastic deforma- 
tions. 

The crystal dimensions also affect the form of the picture. If 
the crystals are very large, the roentgenogram ring is not solid, 
i.e., it breaks down into dots, each of which is due to “reflection” 
from an individual crystal. If the crystals are very small (of the 
order of 10-*~10~° em), then, in accordance with the theory, the 
lines begin to become smeared. Based on this knowledge, methods 
of estimating the average dimension of crystals have been devel- 
oped. 

Pot purposes of analysis, another problem arises in a number of 
cases, namely, to construct the apparatus in such a manner that the 
X-ray spectrum radiated by a substance may be investigated. As 
will be seen below (p. 503), every substance can produce a character- 
istic X-ray spectrum. The fact that the spectra of different kinds 
of atoms differ significantly from one another may be utilised for 
the conduction of qualitative and quantitative analyses. For this 
purpose, the face of a large crystal, for which the interplanar dis- 
tance is known, is placed in a “reflecting” position. By rotating 
this crystal and measuring the diffraction intensity for each angle, 
the wavelengths present in the spectrum of the substance under 
investigation, and their intensities, may be determined from the 
values of the angle @. 


CHAPTER XXIII 
DOUBLE REFRACTION 


147. Anisotropic Polarisability 


When a substance is placed in an electric field, the bound charges, 
i.e., the electron clouds of the molecules, are displaced from 
their equilibrium positions and form electric dipoles. In discussing 
the polarisation of a dielectric (p. 250), we spoke of the displacement 
of electric charges along lines of force. But clearly the situation 
may beentirely different in actual molecules. If an atom has a spher- 
ically symmetric electron cloud, the latter is indeed displaced to 
the same extent in any direction. The polarisability of such an atom 
is said to be isotropic. It is evident that even a diatomic molecule 
cannot be isotropic, for the nature of the electron bond parallel and 
perpendicular to the line connecting these two atoms, and there- 
fore the polarisability in each direction, is not the same. The mag- 
nitude of the polarisability, and hence of the displacement (dipole 
moment), will vary depending on the direction of the electric 
“vector relative to the axis of the molecule. 

It is also significant that the displacement direction and the 
electric force direction do not, generally speaking, coincide. This 
may be illustrated by a mechanical model—a bead attached to two 
perpendicular springs of different elasticity. Assume, for example, 
that a force acts at a 45° angle to the bonds. Different forces will 
then act along the bonds. Let us assume that the elasticity of the 
horizontal spring is three times greater than the elasticity of the 
vertical spring. The vertical displacement will then be three times 
greater than that of the horizontal, and the displacement vector 
will form a large angle with the electric force vector. In exactly 
the same manner, the induced dipole moment forms an angle with 
the direction of the field intensity E. 

Almost all molecules possess anisotropic polarisability, but 
this polarisability is by no means manifested in all cases. It does 
not manifest itself in liquids, amorphous bodies and gases. Thus, 
for each molecule whose dipole moment is inclined to the “left”, 
there is a conjugate whose dipole moment is inclined to the “right”s 
The resultant dipole moment of such a pair of molecules, and hence 
the dipole moment per unit volume, i.e., the polarisation vector P, 
is directed along the vector E. 

However, even in the case of crystals, where the arrangement 
of molecules is uniform, the anisotropic polarisability of the mole- 
cules is not necessarily manifested. Cubic crystals possess isotropic 
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polarisability. Here, the equation P=aF remains valid, just as 
for isotropic bodies. To make this clear, let us consider molecules 
whose electrons may be displaced only in a single direction. The 
symmetry of a cubic crystal is such that molecules whose axes of 
polarisability form right angles may always be found in the crystal. 
Let us consider three such molecules whose axes of polarisability 
lie along the x, y, z axes of a Cartesian system of coordinates. When 
an arbitrarily directed field E is applied, these molecules become 
polarised and produce dipoles of moments BE cos a,, BE cos a, and 
BE cos a,, since the moment is proportional to the projection of E 
on the polarisation direction. The re- 
sultant of these three dipole moments 
is obtained by vector addition. But 
G1, G., a, are the angles formed by 
the vector E with the coordinate 
axes. Hence, the magnitude of the 
resultant moment is 
EY 6? (cos* a, + cos? a, --cos*a,) =BE 
and the direction is parallel to £. Tak- 
ing the summation for all the mole- 
cules, we arrive at the same conclu- 
sion as regards the polarisation vec- 
tor P and the polarisability a’ per unit 
volume. 

Fig. 169 Now, let us consider crystals hav- 

ing one main axis, i.e., crystals 

with tetragonal and hexagonal syngony. To be specific, let us 
consider the former, i.e., crystals in which each molecule has three 
identical molecules related to it via an axis of symmetry of the 
fourth order. Assume, moreover—again for purposes of simplicity— 
that the molecule can be polarised only along one axis. Let us di- 
rect our attention to the four molecules whose axes of polarisahility 
form an angle e with the main axis (see Fig. 169). How do these 
molecules behave in electric fields of various directions? If the vec- 
tor E is directed along the main axis, the polarisation is proportion- 
al to cose. Moreover, in view of the symmetry of the arrange- 
ment, the resultant dipole moment of these molecules is parallel 
to E; hence, the polarisation vector is also parallel to E: 


P=ajE, 


where a, is the polarisability created by all the molecules for this 
direction of the field. 

In the projection onto the plane perpendicular to the main axis, 
the polarisability axes form right angles with each other. Therefore, 
the result obtained for a cubic crystal is valid here, namely, the 
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polarisability is the same for all directions of Ein the plane perpen- 
dicular to the main axis. If E is perpendicular to the main axis, 
the polarisation vector P is again parallel to E: 


P=a,E. 


While the polarisability of the molecules along the axis is pro- 
portional to cose, the polarisability of the molecules in the direc- 
tion perpendicular to the axis is proportional to sine. This means 
that a, and a, are different. ; 

What is the situation when the field E is inclined to the main 
axis of the crystal? In view of the difference in the polarisabilities 
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Fig. 170 


a, and a,, the vector P can no longer coincide with the direction 
of the field, and the value of a will also be different. Knowing ay 
and a ;, @ may be calculated for any direction. We shall not explain 
how this is done in the general case, but merely cite a numerical 
example. 

For a crystal of Iceland spar (calcite), a; —0.139 and ay —0.095. 
Assume that the vector E forms a 30° angle with the plane perpen- 
dicular to the main axis and that it is directed as shown in Fig. 170. 
The polarisation vector in the indicated plane is then 


P, =a ,£, =0.139 x Ecos 30° = 0.120 E. 
The polarisation vector perpendicular to this and parallel to the 
main axis is 

Py =ayFy =0.095 x E sin 30° = 0.0475 E. 
Therefore, the angle between the resultant polarisation vector P 
and the plane is arctan pe? 21°40", This means that the angle 
between P.and E is ~8°20’.The magnitude of the polarisation vector is 
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P=) P* +P) =0.129 E, i.e, in this case the polarisability a 
is equal to 0.129. 

If E is directed at a 45° angle to the main axis, the polarisabil- 
ity a decreases even more relative to a, and becomes equal to 0.120. 
The angle between E and P is then 
0.095 


° ; 
45° — arctan 0-139 


~=10°30’. 

The direction of the vector E in Fig. 170 is such that the angles 
formed by this vector with the polarisability axes of the molecules 
differ. Thus, the angles are greater for the pair of molecules on the 
left. As a result, the right pair of molecules is polarised to a greater 
extent. 

The fact that the polarisability of a crystal possesses an axis 
of symmetry of the fourth order does not signify symmetry in the 
contributions of individual molecules to the magnitude of the 
polarisability for an arbitrary field direction. | 

Thus, the values of the polarisabilities differ for different di- 
rections. This has important consequences: The polarisability is 
uniquely related to the dielectric constant; but e determines the 
index of refraction (see Sec. 125; ¢=n’) and hence the wave propa- 
gation velocity in the crystal; as a result, the electromagnetic wave 
is propagated in the crystal with different velocities in different 
directions. 

Tetragonal and hexagonal crystals (in optics they go under the 
heading “uniaxial”) possess the following characteristic: All orien- 
tations obtained by successive rotation about the main axis are 
optically equivalent. Crystals having a lower order of symmetry 
do not possess this feature. 

Uniaxial crystals have a main direction and, perpendicular to 
it, a main plane. If the vector E points in this direction or lies in 
this plane, then P\|E (and, therefore, D||E). Analysis shows that in 
other crystals only three main mutually perpendicular directions 
in which P\|E may be distinguished. 


148. Propagation of Light in Uniaxial Crystals 


Division of a Light Field into Two Waves. We shall restrict 
ourselves to the study of phenomena occurring when light is inci- 
dent on the face of a crystal cut in two different ways, namely, per- 
pendicular to the main axis and parallel to the main axis. 

Light propagated along the main axis differs in no way from 
light propagated in isotropic bodies. The electric vector produces 
polarised oscillations of the dipoles in the direction perpendicular 
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to the main axis. Hence, the wave is propagated with the velocity 
v= <, where n, = Ve,; & Lis the dielectric constant for the direc- 


tion perpendicular to the axis. The designations n, and v, indicate that 
the index of refraction and the velocity of light are “ordinary”. 


Recalling that e=1+-4na,we find that for Iceland spar? np=Y 1 + Ana; =1.658, 
Hence, v,= 1.81 10! cm/sec. 


The passage of light through such a crystal in the direction 
of the main axis does not change its polarised state. Natural light 
remains natural, and the oscillation direction of the electric vector 
for a polarised wave does not change. 

The simplicity of the case considered is characteristic of a uni- 
axial crystal. Here, any polarised state of an incident wave is capable 
of exciting oscillations in the direction 
perpendicular to the main axis. Hence, 
the polarisability of the molecules (also 
e and n) is the same for all such oscil- 
lations. 

Now, let us consider the case of nor- 
mal beam incidence on the face parallel 
to the main axis. 

Different polarised waves behave 
differently. Consider the behaviour of a 
linearly polarised beam. If the electric 
vector is perpendicular to the axis, the 
light is propagated with the same veloc- 
ity v, as in the preceding case. But if 
the electric vector is parallel to the axis, the polarisation of the 
dipoles occurs along an axis for which the dielectric constant has 
another value, namely, e,. Therefore, for this propagation direc- 
tion, the velocity and the index of refraction have other values, 


namely, v, = < and n, = V €1, respectively. The designations n, and 


v, indicate that the index of refraction and the velocity of light are 
“extraordinary”. The reasons for the above designations will be- 
come evident below. 

Crystals for which v,<v, are called optically positive; on the 
other hand, those for which v,>v, are called optically negative. 


For Iceland spar, n,=VW1-+4m0), = 1.486 ard v,= 2,02 10" cmjsec. Iceland 
spar is an optically negative crystal since v,>Up. 
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Elliptical Polarisation. What is the situation when the electric 
vector E of the wave incident on the face of the crystal forms an 
angle @ with the direction of the main axis (Fig. 171)? Experi- 
ments show, and this may be predicted from Maxwell’s equations, 
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that the electromagnetic wave becomes divided into two parts. The 
vector E must be resolved into the components E sin g and E cos q. 
The first corresponds to a wave travelling with the velocity v,, and 
the second to a wave travelling with the velocity v,. This may be 
shown by determining the path difference between the two waves 
that are created upon division. of the incident wave. Designating 
the thickness of the crystal by /, the phase difference can be ex- 
pressed as b= 1(n,—n,). 

Thus, the polarised state of the wave leaving the crystal has 
changed significantly. The wave incident on the crystal was linearly 
polarised, while the one leaving is a combination of two waves hav- 
ing mutually perpendicular oscillation directions and displaced 
relative to each other by 6. What is the nature of this peculiar po- 
larised state? Such light is said to be elliptically polarised since the 
terminus of the electric vector describes an elliptical spiral. If the 
electric vector of one of the waves is given by 


E,, = Esing cos ot, 


then for the other wave the electromagnetic oscillation in the plane 
perpendicular to the beam will have the form 


E, = E cos g cos (wt + 6). 


The addition of such oscillations has been considered earlier (see 
p- 105). It was seen that the point describing two such oscillations 
is an ellipse. The same applies to the terminus of the electric vec- 
tor, but, since the wave is advancing, the terminus of the vector E 
describes an ellipse in its projection on the plane perpendicular to 
the beain. In space, the terminus of vector E describes an elliptical 
spiral winding about the beam axis. 

To obtain circularly polarised light by this method, a “quarter- 
wave plate” is used. This is a plate producing a path difference of 
1/4 between waves travelling with velocities v, and v,. The thick- 
ness of such a plate must satisfy the equation 


2 n 
i (ny—1,) = 5 + ma. 


If a linearly polarised beam impinges on such a plateso that the vec- 
tor E forms a 45° angle with the direction of the main axis of the 
crystal, resolution of this vector yields 


| eg E 
E,=——cos ot and E, = —~ sinaot, 
; * V2 y V2 
i.e., 


Ei+ B= > E*. 
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But this is the equation of a circle. Thus, the described experimen- 
tal conditions lead to the transformation of linearly polarised light 
into circularly polarised light. 

Double Refraction. The apparent bifurcation of objects viewed 
through a transparent crystal is a phenomenon that has been well 
known for a long time. It shows that division into two waves may 
occur not only as regards the propagation velocities, but also as 
regards the beam directions in space. Double refraction occurs for 
normal light incidence on acrystal face (ground or naturally formed), 
which is at an angle to the optic axis. The phenomenon may 
also be investigated by means of a plate cut parallel to the axis. 
In this case, the light must be incident at an angle to the normal. 
We shall direct our attention to the latter case. Let us introduce 
another restriction, namely, that the beam be directed in such a 
manner that the plane of incidence of the light is perpendicular 
to the optic axis. 

Assume that a polarised beam impinges on the plate at an angle 
i. By turning the beam about its axis, the position of the electric 
vector relative to the plane of incidence is changed. 
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When the electric vector coincides with the incident plane (see 
Fig. 172a), no special effects are noted. Refraction occurs in ac- 
cordance with the law for isotropic bodies, namely, =" =n,. 

The refractive index turns out to be n,. This is as it should be, 
for the electric vector is perpendicular to the main axis of the crys- 
tal. When the beam is turned 90° about its axis (see Fig. 1726), 


veer sini 
it is also refracted. But now — 
sinr, 


is different and the index of refraction is that for an extraordinary 
beam. This too is natural, for the vector E coincides with the direc- 
tion of the main axis. 

Most remarkable is that an intermediate position does not yield 
a beam with an intermediate angle of refraction, but yields rather 
two beams—an ordinary and an extraordinary beam having refrac- 
tive indexes n, and n,, respectively. As before, the field intensity 
vector is resolved into two vectors, one lying along the main axis 
and the other perpendicular to it. Each component creates its own 
field, or wave. In turning the beam of light about its axis, the inten- 
sities of these two beams continuously change; when one beam de- 
creases in intensity, the other increases. 

Since the beams are refracted twice, i.e., in entering and leav- 
ing the plate, the ordinary and extraordinary beams emerge paral- 
lel to each other. The thicker the plate, the greater the separation 
between beams. If a narrow beam of incident light is used, the 
difference between the refractive indexes may be determined by 
measuring the beam displacements. 

Now we can explain why the beams are called “ordinary” and 
“extraordinary”. Let us begin turning a crystal plate, whose optic 
axis is parallel to the face, about the normal to the reflecting face. 
If we were dealing with an isotropic body, such rotation could not: 
affect reflection and refraction. When we rotate the crystal plate as 
indicated, nothing happens to one beam, i.e., its position in space 
and its intensity remain unchanged. That is how an ordinary beam 
behaves. It is understandable, therefore, that the beam whose elec- 
tric vector is perpendicular to the main axis of the crystal is called 
ordinary. In this experiment, the electric vector component lying 
in the plane of incidence is always perpendicular to the main axis 
of the crystal. This component acts in an “ordinary” manner. On 
the other hand, the component of E perpendicular to the plane 
of incidence forms an angle with the main axis of the crystal 
that varies as the crystal is rotated. During such rotation, not 
only does the extraordinary beam’s intensity vary, but its po- 
sition in space varies. as well. We see that the extraordinary 
beam does ngt obey the laws pertaining to isotropic bodies. In 





=n,, i.e., the refraction angle 
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the general case, the refracted beam is not in the plane of in- 
cidence. 

We shall not go into the rather complex explanation for these 
phenomena. It should be noted however, that these phenomena are 
in complete accord with Maxwell’s electromagnetic field theory. 


149. Polarisers. Investigation of the Polarised State of Light. 


It was stated on p. 333 that a dielectric placed with its plane 
surface at an angle @, to the incident beam may serve as a polaris- 
er. In this case, the reflected beam is completely polarised and 
the refracted beam is polarised to the maximum extent possible. 
However, it is not convenient to use a reflecting plate as a polaris- 
er, for the polarised beam travels at an angle to the incident beam. 
A stack of glass plates may be used instead. By repeated refraction, 
an almost completely polarised beam may be obtained. However, 
a considerable portion of the light is absorbed in such a device (see 
Fig. 173). 





The best kind of polariser is a crystal in which the linearly po- 
larised ordinary (or extraordinary) beam may be separated out 
by means of double refraction. Such polarisers are known as Nicol 
prisms, or simply Nicols. 

The polariser proposed by the French scientist Nicol consists 
of two right-angled prisms made of Iceland spar (Fig. 174). These 
prisms are glued together with Canada balsam, a substance whose 
refractive index n is 1,550. This value lies between n, and n, for 
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Iceland spar. A nonpolarised beam of light impinging on the prism 
is divided into two components. The ordinary beam is reflected 
at the boundary between the prisms, where the condition for its 
total reflection is satisfied, and is deflected to one side. The extra: 
ordinary beam passes through both prisms. Thus, a Nicol acts as a 
slit passing only electric vector oscillations directed in a specific 
direction. 

Pleochroism is of great practical importance in Bolater: appli- 
cations. This term is used to indicate that the ordinary and extra- 
ordinary beams are absorbed differently and that the absorption 
coefficient of the extraordinary beam is a function of the direction 
relative to the optic axis. A pleochromatic’ crystal gives a dif- 
ferent hue and absorbs light differently when it is turned relative 
to the beam. 

Tourmaline is a classical example of a pleochromatic crystal. 
The absorption coefficient for the ordinary beam over almost the 
entire visible spectrum is so great that a tourmaline plate of | mm 
thickness, cut parallel to the optic axis, transmits in effect only the 
extraordinary beam and, hence, may serve as a polariser. However, 
the yellowish-green hue of the transmitted light prevents tourma- 
line from being used in practice as a polariser. 

Polaroids, synthetic pleochromatic films, have .wide applica- 
tion and may be prepared from herapathite (quinine sulphate perio- 
dide), a strongly pleochromatic substance. A polaroid is a trans- 
parent plastic film consisting of submicroscopic crystalline hera- 
pathite needles oriented in a single direction. To orient the crystals, 
a viscous mass of the crystals is placed between two glass plates 
and subjected to reciprocating motion. To be sure, herapathite is 
not the only substance suitable for the production of polaroids. 

Iodine atoms, constituents of herapathite; are essential to se- 
cure pleochromatic properties. Pure iodine polaroids may: be pro- 
duced by iodising thin films of polyvinyl alcohol. 

The polarised state of light may be investigated with the aid 
of two Nicol prisms, or other polarisers, and a “quarter-wave plate”. 
A 0.038 mm-thick sheet of mica may serve as the quarter- wave 
plate. Let us consider the passage of light through two Nicol prisms. 
In order to distinguish between the two, the first prism in the path 
of.the beam is called the polariser and the second the analyser (see 
Fig. 175). If a beam of natural light of intensity 7, impinges on 


the polariser, a linearly polarised light of intensity rh emerges 


from the prism. Naturally, turning the polariser about its axis in 
no way changes the intensity of the transmitted beam. By means 
of the analyser, it may be shown that the Nicol prism has indeed 
transmitted a linearly polarised beam. If the ‘Nicol prisms are 
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oriented with their “slits” parallel to each other, the light will 
be transmitted through the analyser as well without change in 
intensity (disregarding absorption in the material of the polaris- 
ing device). For crossed prisms, i.e., when the “slits” are at right 
angles to each other, light is not transmitted (see Fig. 175). The 





Fig. 175 


intensity of the light when an angle a is formed between “slits” 
is [= 2/,cos*a. Thus, the electric vector of the wave arriving at 
the analyser may be resolved into two components—one parallel to 
the “slit” and the other perpendicular to it. Since the component 
that is passed is E cos a (see Fig. 176), 
the intensity is proportional to cos’a. 
The first prism will already reveal 
whether or not the light was partially 
or completely polarised. 
’ Using two Nicol prisms, it is not 
possible to distinguish circularly polar- 
ised light from natural light, and ellip- 
tically polarised light from partially Fig. 176 
polarised natural light. To do this, the | 
quarter-wave plate may be used. If it is placed before the polar- 
iser, it in no way affects naturally polarised light, but circular- 
ly polarised light is transformed into linearly polarised light. 
Similarly, a quarter-wave plate changes the properties of elliptic- 
ally polarised light. 





Analyser axis 


150. A Crystal Plate Between “Crossed” Nicol Prisms 


The method of investigating transparent anisotropic substances 
by observing the behaviour of linearly polarised light impinging 
on them is very widespread. In order not to complicate the problem, 
assume that we are dealing with a crystal plate cut parallel to the 
optic axis. This plate is placed between Nicol prisms. : 

From the polariser, a linearly polarised beam impinges on the 
plate. Remove the plate and place the analyser in a crossed posi- 
tion. Light is not transmitted. Now put the plate back. The field: 
becomes illuminated, i.e., the beam of light is transmitted through 
the system. There can be only one explanation, namely, the crystal 
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plate has changed the polarised state of the beam coming from the 
polariser. By means of the analyser, we may determine the exact 
nature of the change. If upon turning the analyser a new position 
for darkness is found, the conclusion is that the crystal plate has 
changed the direction of the beam oscillations but has left them lin- 
early polarised. If the intensity of the light is not changed by 
turning the analyser, the conclusion is that the plate has trans- 
formed the linearly polarised light into circularly polarised light. 
Finally, if the light is not extinguished by turning the analyser 
or plate, but the intensity of the light is changed thereby, the con- 
clusion is that the plate has created elliptically polarised light. 

The changes produced in linearly polarised light depend on two 
things—the mutual orientation of the plate’s optic axis and the os- 
cillation direction of the beam coming from the polariser, and the 


phase difference 6 = F\(n,—n,) created by the plate between the 


otdinary and extraordinary waves into which the incident wave 
is divided. 

If the substance placed between the crossed Nicol prisms is 
isotropic, no illumination of the field occurs. The phenomenon des- 
cribed above may be utilised in the investigation of anisotropic 
substances. 

Usually, observations are made using crossed Nicol prisms be- 
tween which the plate is rotated. During such rotation, the illumi- 
nation does not remain constant. At every instant, the amplitudeA 
of the light emerging from the polariser is resolved into the compo- 
nents A cos g and A sin g, where @ is the angle between the polar- 
iser “slit” and the optic axis of the plate. The terminus of the elec- 
tric vector of the wave emerging from the plate describes an el- 
lipse: 

EX Ey 2E,E ver 
Ae costo + AF sint?g@ A? eer Geile cos 6 = sin’ 6, 





where 6 is fixed and @ changes continuously. Fig. 177 shows ellipse 
transformations for the case of a quarter-wave plate, i.e., in the above 
formula 6 =90°. For different values of g, different polarised states 
are obtained. 

Since the path difference § depends on the wavelength, the pat- 
tern is coloured when white light is used. If the plate is of uniform 
thickness, it will have a single colour that differs for every differ- 
ent relative orientation of the plate and Nicol prisms. Thus, for 
certain wavelengths of white light, the plate thickness may be 


equal to = for others it may be equal to=, and for still others a 
raultiple of 4. Accordingly, for different wavelengths there arise 
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different polarised states, which are transmitted by the analyser 
to a greater or lesser degree for different relative orientations of 
the plate and Nicol prisms. The phenomenon of chromatic polarisation 





is very beautiful. It is hardly likely that the richness of shades and 
hues observed when the thickness of a plate changes (e.g., in crys- 
tal growth), or when the orientation of a plate is varied relative 
to Nicol prisms, may be achieved 
by any other means. 

If a plate is of variable thick- 
ness, the interference fringes are 
rainbowed when observed in white 
light. 

In addition to these patterns of 
fringes representing equal thickness, 
distinctive fringes representing equal 
inclination may be observed if the 
crystal plate is observed using con- 
verging rays (iconometric investiga- 
tion). These observations may be 
made on small crystalline granules 
in the field of vision of a micro- 
scope. Their practical significance 
lies in the determination of crystal symmetry. In particular, 
it- is not difficult to determine to which of the follow- 
ing three groups an object belongs: 1) amorphous or crystalline 
substances having cubic symmetry; 2) uniaxial crystals; and 
3) crystals having symmetry of lower order. In the first case, interfer- 
ence fringes are not present. Uniaxial crystals give the pattern 
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shown in Fig. 178 when the optic axis coincides with that of the 
incident beam of light. A dark washed-out cross is formed in the 
area where either an ordinary or an extraordinary beam impinges. 
If elliptically polarised oscillations are not transformed, crossed 
Nicol prisms do not transmit light. 

The theory of this phenomenon is too extensive to be dealt 
with here. 


151. Double Refraction Due to an External Action 


As was stated at the beginning of this chapter, almost every 
molecule possesses anisotropic polarisability. An optically isotrop- 
ic body may be formed from anisotropic molecules if the latter 
are randomly distributed. On the other hand, if a preferred orien- 
tation is given to the molecules by any means, the polarisation vec- 
tor, i.e., the total dipole moment of the molecules, no longer has 
the same value in every direction, the body acquires permittivity 
anisotropy and hence optical anisotropy. 

Such anisotropy, expressed in the manifestation of double re- 
fraction, is almost always produced in nonuniformly deformed solid 
bodies, certain liquids placed in electric fields (Kerr effect), and 
streams of liquids whose molecules are of elongated form. Double 
refraction may be observed in biological objects and high-polymers, 
i.e., again in substances consisting of long molecules which cannot 
lie in a completely random manner in the substance. Generally 
speaking, to one or another degree double refraction is almost al- 
ways present since it is very difficult to make a body ideally iso- 
tropic. 

If a body is subjected to one-sided compression or extension, 
an axial type of anisotropy is produced. With respect to optical 
properties, such a body is similar to a uniaxial crystal. Optical 
anisotropy is most conveniently observed between crossed Nicol 
prisms. Lightly pressing a transparent piece of plastic or glass be- 
tween one’s fingers makes it anisotropic and, as a result, the field 
of vision is immediately illuminated. At different locations in the 
object, a nonuniform deformation produces different values for the 
difference n,—n,. Therefore, fringes representing equal phase differ- 
ence 6 are formed on a body subjected to deformation. The forms 
of these curves correspond to the stresses produced in the body. 
By analysing the nature of these curves, it is possible to obtain a 
clear picture of the stress distribution. 

Since this valuable method is applicable only to transparent 
bodies, how can it be used in practice? The approach is clear: con- 
struct transparent plastic models. By constructing a model of a 
bridge, building or machine element and loading it proportionately, 
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a picture of the produced stress may be obtained. The optical method 
of quantitatively determining stress is known as the photoelastic 
method, which is applied under the guidance of specialists in this field. 
Queer coloured patterns appear when observed in white light. 
If a deformation is elastic, the pattern vanishes upon removal of 
the load. On the other hand, if the stress remains upon removal of 
the load, the coloured pattern also remains. - 
Now, let us consider an optically anisotropic liquid located in 
an electric field. The electric field exerts an orienting action only 
if the molecules of the liquid possess a constant, rigid dipole moment. 





Fig. 179 


In this case, the molecules tend to become oriented in such a manner 
that the direction of the rigid dipole moment coincides with the direc- 
tion of the field. Thus, the liquid acquires the properties of a uni- 
axial crystal whose optic axis is parallel to E. This effect is best 
observed when the liquid and the applied field are located between 
crossed Nicol prisms (see Fig. 179). 

Experiments show that the difference n,—n, produced in the 
liquid is proportional to the electric field intensity E squared. 
The phase difference is given by § =B/E?, where / is the distance 
traversed in the liquid by the beam of light and B is the Kerr con- 
stant (characteristic of the substance). 

Nitrobenzene, whose rigid dipole moment is large, is notable 
for its large value of B: 2 10~* (Gaussian units). Benzene has a Kerr 
constant of 0.56107’ and carbon disulphide 3.5107? (Gaussian 
units). 

Example. A 10 cm-long condenser filled with nitrobenzene will act as a 
quarter-wave plate if the field intensity in it is E=7/ %2=26,690 volts/cim 


For this purpose, a potential difference of 2,660 volts must be applied to the 
condenser if the difference between plates is 1 mm. 


The Kerr effect makes it possible to transform electric field 
oscillations into light intensity variations. This effect has little 
inertia: the relaxation time, i.e., the time required for the mole- 
cules to assume the appropriate orientation in the electric field, is 
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of the order of one-thousand millionth of a second. Therefore, elec- 
tric oscillations modulated by sound may be transformed into light 
intensity variations. This makes it possible for sound to be recorded 
on photographic film. 


152. Optical Activity 


The ability of certain substances to change the oscillation di- 
rection of a linearly polarised beam is known as optical activity. 
The phenomenon may be described as follows. Consider an arrange- 
ment of crossed Nicol prisms with an optically active substance 
placed in the path of a beam. The field becomes illuminated, but 
the illumination disappears when the analyser is turned by some 
angle a. Thus, linearly polarised light transmitted through an op- 
tically active substance remains linearly polarised, but the oscilla- 
tion direction of the beam changes by the angle a. Experiment 
shows that the change in oscillation direction is strictly propor- 
tional to the thickness of the layer of substance: 


a= od. 


The constant @ characterising the substance is known as the specific 
rotation constant and is usually expressed in degrees per millime- 
tre. The phenomenon exhibits dispersion since @ is a function of 
the wavelength. Normally, @ decreases with increasing wavelength. 

The change in oscillation direction is quite considerable and 
in the case of many substances attains a value considerably greater 
than ten degrees per mm for a number of wavelengths. For water 
solutions of organic substances, the rotation of the polarisation 
plane is a function of the concentration, i.e., a@—ecd, where c is 
the concentration. 

What kind of substances are optically active? An optically ac- 
tive substance must be composed of structural units which have 
neither a plane of symmetry nor a centre of symmetry among 
their elements of symmetry. In the case of molecular. substances, 
such components are, as a rule, molecules. In the case of crystals, 
in which molecules are not distinguishable, such components are 
unit cells. 

Molecules, or cells, satisfying the above conditions, may be en- 
countered in the form of two optical isomers, designated by the let- 
ters d and / (dextro and levo, or right and left). An object and its 
image in a mirror are optically isomeric. A substance consisting 
of d-molecules (or cells) rotates light to the right, while one con- 
sisting of /-molecules rotates light to the left. By ‘rotation to the 
right we mean the case in which the analyser must be turned to the 
right (from the viewpoint of an observer facing the oncoming beam 
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of light) to restore darkness when the thickness of the layer of sub- 
stance is increased. Reversing the direction of the light does not 
change thé sign of the effect. 

Optical activity may occur for substances in the liquid as well 
as in the solid state. It is merely necessary that there be a surplus 
of d- or /-molecules. The orientation of these molecules may be eith- 
er random or uniform. In the first case, the body is isotropic and 
the rotation is the same irrespective of the direction of the beam of 
light. In optically active crystals, the magnitude of the rotation a 
is a function of the beam direction relative to the crystal axes. 

When molecular crystals that rotate light are fused, the struc- 
tural components remain intact. In such cases, the solid as well 
as liquid substance possesses optical activity. An example of this 
is sugar, which, in addition, possesses activity in solution. This 
property of sugar is utilised in the 
saccharimeter to determine the amount 
of sugar in a solution from the mag- 
nitude of the change in the oscilla- 
tion direction of a beam of light. 

The situation is different in such 

crystals as quartz (see Fig. 180). The 
arrangement of atoms in a quartz 
cell satisfies the necessary conditions, 
namely, it does not have a centre of Fig. 180 
symmetry nor a plane of symmetry. 
The molecules are not distinguishable in a quartz crystal; as a 
result, the arrangement of atoms changes upon .fusion. Hence, in 
fused quartz the necessary structural units are not present. Fused 
quartz is, therefore, optically inactive. 

One and the same substance, from the point of view of chemical 
composition, may be encountered in the optically active form and 
in the optically inactive form. This applies not only to quartz. 
The structure of an inactive variety bears little resemblance to the 
structure of crystals possessing optical activity. 

The above is quite understandable in the case of ionic and ho- 
mopolar crystals. But how can an inactive crystal be formed from 
active molecules in the case of a molecular crystal? This occurs by 
the formation of racemic crystals. A racemic mixture is a mixture 
of equal quantities of d- and /-molecules. Such a mixture does not 
rotate light since the two opposite effects are equalised. A racemic . 
crystal consists of pairs of d- and /-molecules. Every pair consti- 
tutes a centrosymmetric group of atoms. 

Optically active crystals exist in both the d- and /-form. The 
structures of such crystals are identical in the same sense that right 
and left gloves are identical. In the case of molecular crystals, this 
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means that in one case the structure is composed of d-molecules and 
in the other of /-molecules. Dextro- and levo-quartz, dextro- and 
levo-glucose, dextro- and levo-tartaric acid—all the properties of 
these substances, all the details of their structure, are the same 
except for the fact that light is rotated in different directions. 

Inorganic optical isomers (e.g., dextro- and levo-quartz) are 
encountered in nature in equal quantities. This is not the case with 
organic molecules, which are important in biology. The French 
chemist Pasteur showed that a number of microorganisms are ca- 
pable of feeding on only a specific optical isomer. 


153. Basic Theory of Optical Activity 


How is the phenomenon of optical activity explained? Before 
answering this question, we shall show that a linearly polarised 
beam is equivalent to two circularly polarised beams that are dextro-. 
and levo-rotated. 

Let us write the equations for the electric vector oscillations, 
taking into account that there is a phase difference 6 between these 
circularly polarised waves. For the dextro-rotated wave 


E{=E,cosot and Ey=E, sin at; 
for the levo-rotated wave 
E’, = E, cos (wt +6) and E, =— E, sin (at +8). 
The total. field has the components 
E,=E%+E, and E,=E,+ Ey. 

To determine the polarised state of the resulting oscillation, let 
us calculate the ratio fa for the total field. Using simple trigonomet- 
ric transformations, we obtain 

Patan sy. 
The ratio is independent of time. It is seen that we are dealing 
with linearly polarised oscillations that form an angle > with 


the x-axis. Q.E.D. 
On the basis of this conception, the phenomenon of, rotation of 
oscillation direction is quite easily understood. Rotation of the 


oscillation plane by an angle a signifies that the levo-rotated wave 


is lagging behind the dextro-rotated wave (or vice versa, depend- 
ing on the rotation direction) by the angleé. In view of this expla- 
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nation, it is clear why a discussion of optical activity has been in- 

cluded in the chapter on double refraction. Here, too, the wave is 
divided by the substance into two components, one moving faster 
than the other and continuously advancing relative to it in phase. 
From this viewpoint, the specific rotation is proportional to the 
ees in refractive index between the dextro- and levo-rotated 
eams. 

This discussion has in no way advanced our understanding of 
the phenomenon. We have merely given it another (completely equiv- 
alent) interpretation. However, this new approach enables us to 
more easily explain optical activity. Circularly polarised waves 
that are dextro- and levo-rotated travel through a substance with 
different velocities. They encounter different indexes of refraction 
and, hence, different permittivities and polarisabilities. The dis- 
placements of the electron cloud under the action of these two waves 
must differ. One wave experiences more difficulty than the other 
in displacing electrons from their equilibrium positions. If we can 
determine the reason for this difference, the explanation for optical 
activity will have been found. 

We know from chemistry that if a molecule has an asymmetrical 
carbon atom, the substance may exhibit optical activity. By an 
asymmetrical carbon atom chemists mean a carbon atom bound to 
each of four different atoms or radicals. 

The angles formed between the bonds of a tetravalent carbon 
atom are approximately equal to those in a tetrahedron. Fig. 181 
shows a molecule containing an asymmetrical carbon atom. The 
radicals or atoms bound to C differ, but their nature is unimportant. 
We see, in the first place, that two different molecules of sucha sub- 
stance, which are mirror images of each other, are possible. These 
are optical isomers and cannot be made to coincide. This may be 
easily demonstrated using wire models. 

Consider a circularly polarised wave travelling along the axis 
of symmetry of the bonds. In Fig. 182 the wave is directed outward 
from the page. Atoms A and B are higher than atoms D and E. Let 
us determine the directions of the electron displacements for dextro- 
and levo-rotated waves. Assume the situation is as follows for the 
dextro-rotated wave: When the vector E is directed along ED, in 
the upper “level” it is directed along BA. If that is the case, the 
situation is as follows for the levo-rotated wave: When the vectof 
E is directed along ED, in the upper “level” it is directed along AB. 

Examining the figures, we see that the displaced electrons be- 
have differently. In the first case, the electrons of atoms A and D 
move simultaneously away from the centre. In the second case, when 
the electrons of A move toward the centre, the electrons of D move 
away from the centre. Such differences will always be found for 
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systems of atoms that do not have a centre of symmetry and a plane 
of symmetry. On the other hand, if these elements of symmetry are 
present, the action of dextro- and levo-rotated waves will be the 
same. 

Different electron displacement conditions for asymmetrical 
groupings lead to different polarisabilities and, hence, to different 
indexes of refraction for dextro-and levo-rotated waves. It is clear 
that the behaviour of a dextro-molecule is opposite to that of a levo- 
molecule. 

A deeper theoretical analysis shows that the effect is maintained 
for all orientations of the molecules relative to the beam, 


CHAPTER XXIV 
THE THEORY OF RELATIVITY 


154. Basic Theory 


The theory of relativity, developed at the beginning of this 
century by the great modern physicist Albert Einstein, is based on 
two postulates: 1) the principle of relativity and 2) the principle 
of constancy of the velocity of light. We shall briefly consider the 
essence of these principles, describe the experiments confirming 
them, and discuss certain consequences of the theory. 

The theory of relativity originated with the questioning of the 
existence of a mechanical carrier (ether) for an electromagnetic field. 
The theory of relativity solved this problem and in this sense may 
be viewed as the perfection of electromagnetic field theory. While 
solving the problems posed by electrodynamics, the theory of rel- 
ativity went much further. Its development led to the establish- 
ment of the laws of mechanical motion at velocities close to that 
of light, to the law of the equivalence of mass and energy, and to 
new views on the nature of gravity. Since our discussion will be, 
of necessity, very brief, we are forced to dispense with an histori- 
cal narration. 

First, as to the essence of the main postulates. The principle | 
of relativity states that all laws of nature (and not only the laws 
of mechanics) are the same in all inertial systems of coordinates. 
The principle states that not a single physical experiment could 
discover special properties for one of the inertial systems. All iner- 
tial systems are equivalent. 

The second postulate pertains to the constancy of the velocity 
of light in a vacuum for all inertial systems. From this it follows 
that the velocity of light in the “receding” and “approaching” di- 
rections must be the same, i.e., that the velocity of light is inde- 
pendent of the light source and measuring instruments. 

How do these principles affect our views concerning an electro- 
magnetic field and its carrier? It is not difficult to see from the for- 
mulations of the principles that electromagnetic waves and, say, 
sound waves, are not analogous. 

Imagine a laboratory isolated from the external world, moving 
rectilinearly and uniformly relative to the stars. In this laboratory, 
measurements are made of the velocity of sound in the direction of 
motion. Theoretically, two extreme cases are possible: in one, the 
walls of the laboratory are impervious to air, so that the air is car- 
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ried along by the laboratory; in the other, the walls are pervious 
to air, the air is stationary relative to the stars, and the laboratory 
moves through the air, i.e., without carrying it along. Assume 
in these two cases that measurements are made of the velocity of 
sound. The velocities are measured by two observers—one moving 
and the other stationary relative to the stars. In each case, the ve- 
locities of sound relative to these two observers will differ. If the 
velocity of sound in air is designated by ec and the velocity of the 
laboratory relative to the stationary observer by v, then, in the case 
in which the air is carried along, the moving observer finds the 
velocity equal to ¢ and the stationary observer finds it equal to 
c-+v; in the case in which the air is not carried along, the moving 
observer finds the velocity equal to c—v and the stationary ob- 
server finds it equal to c. 

The postulates of the theory of relativity reject both variants 
in the case of an electromagnetic wave in an ether. In experiments 
with light waves, the velocity of light will be equal to ¢ for the sta- 
tionary as well as the moving observer. This means that a station- 
ary as well as a moving ether is incompatible with the theory of 
relativity. Thus, the theory of relativity rejects the possibility of 
viewing the field as a medium in which. mechanical displacements 
occur. We must conclude that electric and magnetic fields have 
a real existence. 


155. Experimental Verification of the Principle of Constancy of the 
Velocity of Light 


At first glance, the principle of constancy of the velocity of 
light seems to fly in the face of “common sense”. Therefore, before 
discussing certain consequences of the theory of relativity, it is 
desirable to describe the direct experimental evidence for its va- 
lidity. This evidence is derived from astronomical observations. 

Astronomers have discovered the existence of so-called double 
stars. A double star consists of two heavenly bodies of approxi- 
mately the same mass rotating about their overall centre of gravity. 
We have the means to measure the distance between the stars, their 
mass and their velocity; also, to determine their relative motion. 
If the velocity of light depended on the velocity of the star itself, 
the velocity of the heavenly body would be added to the velocitys 
of light when this body moved toward a terrestrial observer and 
subtracted when this body moved away from the terrestrial observ- 
er. In such a case, to the terrestrial observer, the motion during 
one half of the orbit would appear faster than during the other half. 
This effect would be detectible even if the velocity v of the heavenly 
body were one-hundred thousandth of the velocity of light c. 
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and — 
(a) c--u 
may be so considerable, even for very small values of v, that not 
only is the periodicity disturbed, but a light beam transmitted 
during motion in the “receding” direction may overtake a beam 
transmitted during motion inthe “approaching” direction. Then, rota- 
tion of the star would not be visible or would be of a peculiar nature. 
The periodic rotation of double stars may be understood only on 
the basis of the principle of constancy of the velocity of light. 

To be sure, our discussion has dealt with the motion of a light 
source, so that there may remain some doubt regarding the validity 
of the principle of constancy of velocity for the motion of an observ- 
er. Such doubt may be removed by another astronomical observa- 
tion, i.e., observation of the periodicity of the motion of Jupiter’s 
satellites. Measurements of the motion of Jupiter’s satellites may be 
made in two cases—when the light arriving on the Earth from 
Jupiter coincides with the direction of motion of the solar system 
and when it is in the opposite direction. The identicalness of the 
observations and the distinct periodicity of Jupiter’s annual motion 
demonstrate the validity of the principle of the constancy of the 
velocity of light in this case as well. 

The most important role in the development of the theory of 
relativity was played by an experiment first performed by Michel- 
son in 1881 with the aid of the interferometer described on page 397. 
This experiment consisted in the following. The locations of two 
mirrors, i.e., the arm lengths /, and /,, were selected in such a man- 
ner that the coherent beams into which a light signal is divided 
would require the same amount of time to cover the distances along 
the two arms of the interferometer. This selection is made when 
the interferometer is arranged in such a manner that one of the 
arms is parallel to the motion of the globe in its orbit. The instru- 
ment is then turned 90° and the interference fringes observed for 
possible displacement. 

The results of the Michelson experiment, which was repeated 
many times by Michelson and other investigators, are the following: 
no displacements of the fringes occur and the times required for 
light to cover the distances along the arms remain equal when the 
instrument is turned 90°. This conclusion is based on very accurate 
measurements. 

What is the significance of this experiment? Since the Earth 
moves with a velocity v+30 km/sec relative to fixed stars, the dis- 
tances covered by the two rays cannot be the same from the view- 
point of a celestial inertial observer. 

Let us examine the paths of the two rays (see Fig. 183). Of 
course, we need only concern ourselves with the portions of the path 





Thus, for a long distance /, the difference in times — 
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along which the beams travel separately. The longitudinal beam in 
the “receding” direction must cover the distance of the arm length 
1, and overtake the mirror moving with a velocity v in the same 
direction. Therefore, the distance ct, covered by the beam must 
equal /,--vt,. The time required for the wave front to reach the 
mirror is 

I 


v ° 
In the “approaching” direction, the beam covers the distance of 
the arm length 7, minus the distance covered by the approaching 


i= 





Fig. 183 Fig. 184 


instrument. Therefore, the distance ct, covered by the beam must 
equal /,—vt,. Then, 
4, 


c (: +>) 
The time t,+-1, measured in the experiment is equal to 
Ql, 
c (1-3) 
Now, let us direct our attention to the transverse beam (see 


Fig. 184). During the total time t, i.e., the time elapsed from the 
instant the beam leaves the centre of the instruments to the instant 


T= 
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it returns, the mirror is displaced as shown in the figure. Therefore, 
the distance covered by the wave is 


ct=2 V a+(F) 3 


whence, the time t is equal to 
21, 


255 fier 
‘Vira 


In the first measurement, the arms /, and /, were selected in such a 
manner that the times required for the beams to cover the separate 
paths were equal. Hence, 


21, oF. a bi ! 


TORY Se eee 
(ing) eYi-e yi-s 


However, in the second experiment, i.e., when the interferometer 
is turned 90°, there is no interference fringe displacement and the 
times remain equal even though the arms /, and /, have interchanged 
places! This is the surprising result of this experiment. 

Thus, if the first arm is longitudinal, 








l =1, 1/123, 


if the second arm is longitudinal, 


Retg/ ie. 


For v=0, /,=/,; but for v40, we obtain a remarkable result: the 
length of one and the same segment differs, depending on whether 
this segment is parallel to the direction of motion or perpendicular 
to it. The obtained result is valid for any body and for any distance 
between two points. Thus, the first consequence of the theory of 
relativity is that a body moving relative to an inertial observer 
shortens its dimension in the direction of motion. The transverse 
dimensions remain unchanged. If an observer relative to whom an 
object is stationary finds that the length of this object is /,, an ob- 
server relative to whom this object is moving with velocity v will 


find that its length is 
Lay ie 
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. Example. \f an object moves with a velocity of 1,000 km/sec relative to some 
“stationary” observer, the length of the object in the direction of motion ap- 
pears to be / divided by 1.000005. If the velocity‘of the object is 200,000 km/sec, 


then “% =1.34, 


The length of one and the same segment moving in a specific manner 
in different frames of reference, will differ. It is necessary to under- 
stand properly the relative nature of the above contraction. Take 
two rods of the same lengith/, and assume that their relative velocity 
is v. Now, assume that there are two observers—one moving with the 
first rod and the other with the second. In such acase, the first observer 


will find that his rod has a length /, and the other a length /, i. 


For this observer, the second rod will be shorter than the first. On 
the other hand, the second observer finds that the second rod has a 


length J, and the first a length /, V 1%, For him, the first rod 


is shorter than the second. 

The length of a rod (or, in general, the distance between two 
points) is a relative concept. Of all rod lengths measured in various 
inertial systems, the rest length /, is outstanding. This maximum 
length of a rod has absolute meaning. 








156. Time in the Theory of Relativity 


In the expression relating the length of a rod at rest to the length 
of arod in motion, the factor Y1—B* appears, where p=— j 
This factor also appears in analogous formulas relating the values of 
various physical quantities for stationary and moving observers. Using 
an approach similar to that taken in the preceding article leads to 
interesting results as regards time and acceleration, mass and force, 
momentum and energy, density of charge and current, field intensities, 
etc. The formulas of the theory of relativity enable us to convert 
values determined by a stationary observer to values determined 


by a moving observer. The ratio p=— is in all cases an important 


criterion of the need for a relativistic correction. 

It is easily seen that B* is comparable to unity only when the 
velocity is very large. Even when v=100,000 km sec, V1 —6? is 
only several per cent less than unity. It is, therefore, clear that fhe 
theory of relativity yields negligible corrections when the motion 
is slow, i.e., in such cases it is not necessary to take into account 
the changes in physical properties with motion. The theory of rel- 
ativity is of particular importance for the microworld, where 
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particles having velocities approaching the velocity of light are 
encountered quite often. 

Let us direct our attention to the consequences of the theory as 
regards time. It turns out that the interval t during which an event 
occurs is also not the same from the viewpoint of two different 
inertial systems. Thus, two events occurring simultaneously from 
one viewpoint occur at different times—one earlier and the other 
later—from the viewpoint of another frame of reference. . 

Qualitatively, this assertion follows immediately from the 
principle of the constancy of the velocity of light. Thus, consider 
a system moving uniformly and rectilinearly relative to another 
inertial system. In one of them, there is located a radiator from which 
light is radiated in all directions. In this system, let us select two 
points equidistant from the source of light along a straight line in 
the direction of relative motion. It is clear that in this system the 
light arrives at both points simultaneously. This is the situation 
from the viewpoint of an observer moving together with the source. 
However, to an observer in the other system the situation appears to 
be different. To this observer, one point is moving toward the signal 
and the other away. Since the velocity c has the same value for 
this observer too (moreover, the same in both directions), from his 
viewpoint the light arrives earlier at the point that is behind. 

A doubt may arise: cannot such a conclusion lead to absurdities? 
One may reason that since the concept of simultaneity is relative, 
it may happen that from the viewpoint of one frame of reference 
a gun is fired and then a wounded bird falls from a'tree, but from the 
viewpoint of another frame of reference the bird falls before the 
gun is fired. Careful analysis shows that the relativity of a sequence 
of events is limited by the velocity of propagation of interaction 
(less than c). Therefore, “earlier” and “later” may interchange places 
only when they are not causally related, i.e., when they are not 
the result of interaction. 

A very interesting result of the theory relates to the proper time 
of an object, i.e., the time determined by a clock moving together 
with a given body. If a time t has elapsed according to the clock 
of an observer in a certain inertial system, the handle of the clock 
moving with the object will have advanced by the time 


t,=tV1—f*. 


This means that a clock moving in any arbitrary manner moves 
slower than a stationary clock.* 


* This formula has found experimental confirmation in experiments with 
fi-mesons (see p. 54). 
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It is necessary to comprehend properly the relative meaning 
of this assertion. If two observers are in different inertial systems, 
each will assert that the clock of the other observer is slow. This 
would seem to be a paradox. Let us stop the observers and compare 
their clocks. However, to perform this check, at least one of the 
observers must perform a complete circuit. Upon returning to the 
point of departure, the clocks may be compared. But now the de- 
terminations of the observers have lost their relativity. The ob- 
server remaining in an inertial system is justified in applying the 
above formula. The observer executing the circuit has undergone 
accelerated motion; hence, the formula t,=tV 1— * cannot be used 
by him. Thus, after the observer executing the circuit returns and 
the clocks are compared, it turns out that his clock is slow. More- 
over, he cannot “dispute” this result by reference to the principle 
of relativity, for this principle is valid only for inertial observers. 


157. Mass 


If the mass of a body measured in a system of coordinates to 
which it is bound is designated by m,, to an observer relative to 
whom this body moves the mass appears to be 


Mo 
os er ae 


The quantity m, is known as the rest mass and the increase in mass 
with increasing velocity is a natural consequence of the fundamen- 
tal principles of the theory. The velocity of light ¢ constitutes a 
limiting velocity for any motion or transfer of interaction. For 
v=c, the mass of a body becomes infinite. Of course, the closer a 
body approaches the limiting velocity, the more difficult it is to 
accelerate it. 

The increase in mass with increasing velocity was first detected 
for the electrons of B-rays as early as the beginning of this century. 


Since the electron velocity v and the ratio <. may be determined 


independently (see p. 462), and since the electron charge remains 
unchanged, we are able to check the formula for mass. 
Corrections given by the factor VY 1—? playan important rolain 
the construction of accelerators of charged particles. The particle 
velocities attained in modern accelerators are so great that, for 
example, in one of them B reaches a value of 0.9986; thus, the 
mass becomes 60 times heavier than the rest mass. In all experi- 
ments conducted under terrestrial conditions with macroscopic 


bodies, we can disregard the V1 —? correction to the value of the 
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mass. Nevertheless, it is desirable to check its validity not only 
for elementary particles. This is possible by means of precise astro- 
nomical observations. It turns out that the change in mass of the 
planet Mercury during its orbital motion explains the small devia- 
tions of the orbit from an ellipse. 

The momentum formula acquires the following form when we 
substitute the expression for the mass of a moving body: 


MV 


= Vi-p Pe 

It spend, ae noted that Newton’s law remains valid if it is writ- 
ten as F=“. On the other hand, the formula F=ma will no longer 
be valid in all cases. 





158. Energy 


In Sec. 10, we obtained an expression for the energy of a moving 
body by finding a function that increases as the work expended in 
accelerating the body. Let us repeat these calculations taking into 
account the corrections provided by the theory of relativity. 

The work of displacing a body by a distance dl is 


Fdi= ’ di = dp“! = vdp, 


where v is the velocity seulsn If this work serves to increase the 
energy 6 of the body, then 


dé = vdp = td (mv) = medv +0°dm. 





4 mM, modv 
Since m= vie then dm= =———35 a (1 — pe) hence, 
i __ moda 
dé = (1 pea ay —55;) mv dv, i.e., dé = BR: 


Comparing the last expression with the formula for incremental 
mass, we find: d@=c*dm, i.e., 


Gé=me’. 


We have dropped the additive integration constant, for when m=0, 
6 must also be equal to zero. 

Thus, the work done on a body serves to increase the function 
&€ =me’*, which has, therefore, the significance of energy of the body. 

The fundamental result of this calculation consists in the follow- 
ing: an increase in the mass of a body is accompanied by an in- 
crease in its energy (and, hence, an expenditure of external energy); 
on the other hand, a decrease in the mass of a body or system is accom- 
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panied by a decrease in its energy (and, hence, a transfer of energy 
to its surroundings). There is a direct and universal relationship 
between mass increment and energy increment, for c? is a constant 
quantity. 

But what is the nature of the energy @? Is it an energy of motion? 
Evidently not. If the body is at rest, € does not equal zero but equals 
mc’. Therefore, U=m,c’ is the rest energy of the body, i.e., the 
internal energy of the body, and the difference mc’—m,c* is the 
energy of motion. 

The first part of the last sentence should be viewed as an asser- 
tion that may be verified experimentally. As for the energy of mo- 
tion, me’—m,c’, this will be recognised as the familiar expression 
for the kinetic energy if the following approximation is used: 








1 1 oe 
Vip Teh 
To this degree of accuracy, 
Ba: ae 2 1 — 2., 1 ge__ Mv? 
mc*—m,c* =m,c (Fac ee Xz P=. 


Example. The internal energy of a body of mass my=1 kg is U=m,c?=9X 108 
joules=2.1610'* keal. This is the equivalent of the quantity of energy that 
would be released in the form of heat in the combustion of 3 million tons of coal. 
Even in thermonuclear reactions, only several per cent of these tremendous 
reserves of internal energy are released at present. 


159. Mass Defect 


As was indicated in the preceding article, the expression relating 
rest mass to the internal energy of a body, i.e., U=m,c®, may be 
verified experimentally. 

The internal energy of a body consists of the rest energy of the 
component parts, their kinetic energy and their potential energy 
of interaction. A change in any of these component energies affects 
the value of U and, hence, the rest mass as well. Thus, the rest mass 
increases if the temperature of the body rises, i.e., if the internal 
motion of the system increases. The rest mass also increases if re- 
pelling components of the system approach one another or if attract- 
ing components move apart. 

It is clear from the above that the rest mass of a system of inter- 
acting particles does not possess the property of additivity, i.e., 
it is not subject to the law of conservation. If a body of rest mass 
M, consists of N particles, each of mass m,, then M,ANm,. The 


difference 
M,—Nm, = AM 
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is called the mass defect of the body (or system of particles). The 
quantity c?AM 


is called the binding energy. 

If a system breaks up into a number of components, binding 
energy is released and may be measured. Moreover, the rest mass 
may also be directly measured. Thus, the U=m,c* law may be 
verified experimentally. 

Numerical examples show that any change in internal energy 
related to a change in the velocity of motion and the interaction 
force between molecules and atoms cannot lead to a measurable 
change in mass. Experimental verification of this theory is possible 
in nuclear physics (see p. 537). 


Examples. 1. The mass of 1 kg of molybdenum increases by AM=0.000000003 
gm when it is heated to 1,000°C. 

2. If a steel rod of 128 cm length and 1 cm? cross-section (mass of the rod= 
=1 kg) is stretched by a force of 8 tons, the potential energy thereby stored 
in it increases its mass by 2X107™ gm. 


160. The Principle of Equivalence and the General Theory 
of Relativity 


Let us consider a noninertial system of coordinates moving with 
an acceleration a,. Assume that we wish to describe physical phe- 
nomena in this system. The laws of mechanics in this system will 
appear different than in an inertial system, for F=sna is valid only 
for the latter. A stationary body will have an acceleration —a, 
relative to this system. 

If we maintain the terminology used for an inertial system and 
assume that accelération is produced by forces, then the “force” 
field—ma, acting on all bodies in an accelerated system may be 
called an acceleration field and an analogy may be drawn between 
this field and a gravitational field. 

In exactly the same manner, we may introduce additional “force” 
fields in considering phenomena in a rotating system of coordi- 
nates and, of course, in the general case. The fictitious force fields 
that we have introduced for the description of motion from the 
viewpoint of a noninertial system of coordinates may be called fields 
of inertial forces. The force—ma, is an inertial force. 

The motion of a point having an acceleration a@ relative to such 
a noninertial system will obey the equation 


ma=F -+inertial forces. 


Expressions for inertial forces may be found in textbooks on 
theoretical physics. 
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It is important to direct our attention to the theoretical side 
of this question. In noninertial systems, fictitious force fields appear. 
To each such acceleration field there corresponds a fictitious distri- 
bution of attracting mass. Hence, any field created by accelerated 
motion may be interpreted, generally speaking, asa gravitation- 
al field. In this sense, we sometimes speak of the equivalence of 
gravitation and acceleration. 

Let us consider several simple examples. Assume that we are in 
an elevator falling with an acceleration a. Let us drop a ball and 
examine the nature of its fall. As soon as the ball is dropped it be- 
gins, from the viewpoint of an inertial observer, to fall freely with 
acceleration g. Since the elevator is falling with acceleration a, the 
acceleration relative to the elevator floor is g—a. An observer in 
the elevator can describe the motion of the falling body by means 
of the acceleration g’ =g—a. In other words, the observer in the 
elevator need not speak of the accelerated motion of the elevator 
since he has “changed” the acceleration of the gravitational field 
in his system. 

Now, let us compare two elevators. One is suspended over the 
Earth and the other moves in interplanetary space with an accel- 
eration a relative to the stars. All bodies in the elevator suspended 
over the Earth are able to fall freely with acceleration g. But bodies 
inside the interplanetary elevator have a similar capability. They 
“fall” with an acceleration—a to the “bottom” of the elevator. The 
tole of bottom is played by 
the wall opposite to the ac- i 
celeration direction. 

Thus, the action of a grav- 
itational field and the man- 
ifestation of accelerated mo- 
tion are indistinguishable. 

The behaviour of a body 
in an accelerated system of 
coordinates is the same as the 
behaviour of a body in the 
presence of an equivalent 
gravitational field. However, 
this equivalence is complete 
only if we limit ourselves 
to observations over small 
portions of space. Thus, imagine an “elevator” having linear floor 
dimensions of several thousand kilometres. If such an elevator is 
suspended over the Earth, the phenomena occurring in it will differ 
from those occurring in an elevator moving with an acceleration 
g felative to fixed stars. This is clear from Fig. 185. In one case 





Fig. 185 
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bodies fall obliquely to the bottom of the elevator, while in the 
other case perpendicularly. 

Thus, the principle of equivalence is valid for such volumes 
of space in which the field may be considered uniform. 

The above qualitative considerations lie at the basis of the gen- 
eral theory of relativity. This theory was also developed by Ein- 
stein. In it, he sought formulations for the laws of nature, indepen- 
dent of the choice of coordinate system. Until now, we have as- 
sumed that this was possible only for inertial systems of coordinates. 
The principle of equivalence shows that the absoluteness of accel- 
eration may be destroyed by a gravitational field. An accelerated 
system of coordinates may be viewed as an inertial system if we 
introduce an equivalent gravitational field. To be sure, as we have 
just seen, such equivalence is limited in time and space. However, 
Einstein showed that this restriction may be removed if, correspond- 
ing to the introduction of the gravitational field, a change in the 
geometry of the system is introduced. 


CHAPTER XXV 
THE QUANTUM NATURE OF A FIELD 


161. Photons 


On a number of occasions we have indicated that radiation and 
absorption of electrical energy occur in packets, or quanta. The 
magnitude of a quantum depends only on its radiation frequency and 
is equal to Av, where is a universal constant equal to 6.62 x 10 -*’ 
erg sec. It should be noted that the quantum nature of radiation 
and absorption has been established already for the entire electro. 
magnetic spectrum, i.e., from hard y-rays to long radio waves. 

The phenomena of radiation and absorption characterise, in 
the first place, the microsystem interacting with the electromagnet- 
ic field of a wave. The quantum nature of these phenomena (which 
we shall discuss in detail in Part III) shows that a microsystem has 
distinct energy levels and that the values of these energy levels 
cannot be arbitrary. These facts by themselves would not have led 
to the conclusion that this quantum nature is characteristic of an 
electromagnetic field as well as of matter if an electromagnetic wave 
in its interaction with matter did not behave, in a number of cases, 
as a particle. The corpuscular properties of electromagnetic radia- 
tion are manifested when losses and transformations of electromag- 
netic energy occur. The shorter the wavelength, the more distinct 
the effects. These properties, on the other hand, are not manifested 
during propagation, scattering and diffraction of electromagnet- 
ic waves if these processes are not accompanied by energy losses. 

A corpuscle of an electromagnetic field is called a photon. It 
is characterised, in the first place, by the magnitude of its energy: 


6 = hv. 


Using the law of equivalence of mass and energy, we are entitled 
to ascribe to a photon the mass 


€ hy 


a i a 


Since an electromagnetic field is propagated with a velocity e, it 


must be concluded from the formula mae ra that the rest mass 








of a photon is equal to zero. 
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Assuming the concept of momentum is applicable to a photon, 
we obtain 

hy 

p=inc= van 


It should be recalled that the Lebedev experiments (see p. 317) 
directly demonstrate the validity for light of the formula p=", 


the relationship between the momentum density and energy density 
of an electromagnetic wave. The formula for photon momentum is 
in complete agreement with this result. 


Values of ¢, m and p for Photons of Various Types of Electromagnetic 
Radiation 





P (gmcm/sec) 





| 7 | & | mm (gm) 


Radio waves .| 2,000 1072 erg = 0.62 107% ev} 1.1107% | 3.3x107% 
metres 


Visible light .| 6,000A | 3.3x107erg=2 ev 3.61078 | 1.1*1077? 
X-rays ....(1A 19.8 107% erg = 12,400ev | 2.210779 | 6.61079 


As may be seen from what follows numerous experiments convinc- 
ingly show that photons exist. On the other hand, a mass of exper- 
imental evidence prevents us from abandoning our view of an elec- 
tromagnetic field as a continuum. The sharpest contradictions arise 
in considering interference phenomena. These phenomena are ele- 
gantly explained by the wave nature of the field, but are completely 
inexplicable from the corpuscular viewpoint. 

Thus, consider a simple interference arrangement—two close 
apertures through which light may be transmitted. The following 
experiment is easily performed. First, let us photograph the trans- 
mitted light when both apertures are open. We obtain the pattern 
discussed earlier, namely, alternate bright and dark fringes. 
Now, let us close each of the apertures in succession and take 
the photograph on one plate. The result, of course (from the 
viewpoint of wave theory), will be different, i.e., there will be no 
interference. 

Let us now consider how this experiment may be interpreted 
in corpuscular terms. It is conceivable (by stretching one’s imagi- 
nation) that photons fall unequally on different parts of the photo- 
graphic plate owing to rebound from the edge of the aperture or colli- 
sion with one ancther. But the patterns obtained differ depending 
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on whether the light passes through both apertures simultaneously 
or consecutively. Photons passing through one aperture “know” 
whether the other aperture is open or closed. 

This experiment and many others show that it is quite impossible 
to reduce electromagnetic phenomena to only a field pattern or to 
only a system of photons. Each concept is exceedingly fruitfulin 
Me case of one group of phenomena, but fails in the case of the 
other. 

During the last few decades, physicists have energetically sought 
ways of reconciling these two contradictory views of electromag- 
netic radiation. A field is a reality characterised by continuous 
values of field intensity in space and time; a corpuscle is a reality 
occupying a certain limited region of space at a given instant. These 
contradictory qualities are combined in electromagnetic radiation. 
In Chapter X XVII, we shall see that these contradictory properties 
are combined not only in the case of electromagnetic radiation, but 
in the case of matter as well. However, physics has made consider- 
ably more progress in understanding matter than in understanding 
an electromagnetic field. The dual nature of particles of matter is 
described by the Schrdédinger equation (see p. 472); interactions 
between corpuscles and waves for such particles are understood 
quite well. 

Unfortunately, the situation is much worse as regards electro- 
magnetic field (radiation) theory, commonly referred to as quantum 
electrodynamics (for a detailed discussion, see p. 543). Such a com- 
plete theory does not exist. In view of the fundamental contradic- 
tions existing in quantum electrodynamics, its partial successes, 
expressed in the establishment of new relationships between field 
and particles, cannot be generalised. Hence, the interrelation be- 
tween photons and electromagnetic field remains unclear. 

The rules of “translation” from corpuscular terminology to 
wave terminology, and vice versa, are based on the following: An 
electromagnetic wave of length 4 and intensity / may appear as a 


stream of photons of frequency v= + and intensity / =Nhv, where NV 


is the number of photons passing per unit time through unit area. 
The direction of motion of the wave front is the direction of motion 
of the photon. 

We shall not discuss in corpuscular terms the very complex 
problem of the polarised state of light. To do this, it is necessary 
to assume that a photon has a selected direction, or spin (see p. 493 
regarding electron spin). 
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162. Photoelectric Effect 


The escape of electrons under the action of electromagnetic 
waves constitutes important confirmation of the indispensability 
of the corpuscular viewpoint. This phenomenon will be considered 
here from this viewpoint, and again in Sec. 274 when we discuss 
the action of light on metals and semiconductors. 

Since the escape energy of an electron from a metal (see p. 689) 
is not less than 2.2 ey, the photoelectric effect becomes possible 
when Av>3.5 x 107" erg, i.e., for frequencies of the order of 0.5 x 10° 
cycles (A=6,000 A). 

Einstein proposed that the photoelectric effect be viewed as an 
effect of collision between a photon and an electron. In this process, 
the photon gives up all of its energy and ceases to exist. If A rep- 
resents the work function of the electron, i.e., the work required 
to overcome the binding force between the electron and the sub- 
stance, the law of conservation of energy has the form 


fv= ALE, 


where me is the kinetic energy of the photoelectron, the electron 


dislodged from the substance. 

The first means of, checking the validity of the photon hypothe- 
sis consists in verifying the linear dependence between photoelec- 
tron kinetic energy and frequency of incident radiation. 

The photoelectron energy is determined by the bias potential 
method. If the surface of the substance from which the electrons 
are dislodged constitutes a condenser plate, current flows through 
the circuit in which this condenser is connected. Current ceases to 
flow when an appropriate bias voltage is applied to the condenser. 
This condition is given by 

mu? 

ey, = 7 . 

It should be realised that the greater the depth from which the 
electrons are dislodged, the smaller the velocities. Therefore, cur- 
rent ceases to flow when the electrons closest to the surface are pre- 
vented from escaping. By experimentally determining eU, for var- 
ious frequencies v of electromagnetic radiation, curves of U, vs. v 
may be plotted. The ideal straight lines obtained are shown in Fig. 


186. The slope = of the straight lineeU, —hv—A may be calculated 


from other data, providing another independent means of checking 
the validity of the theory. 

Nevertheless, the above experiment cannot be considered direct 
proof of the photon hypothesis. The possible objection is that the 
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photoelectron may gradually accumulate the energy transmitted 
to it by an electromagnetic wave. This objection was answered by 
the classical experiment of A. F. Yoffe and N. I. Dobronravov. 
Earlier, Yofle had investigated the photoeffect by means of a par- 
ticle of dust suspended between the plates of a condenser. Owing to 
inevitable air friction, 

the particle of dust Us,volts 

carries a charge and, 
hence, its weight may be 
counterbalanced by an 
electric field. For equi- 
librium gE —mg, where 
m is the mass and q is 
the charge of the parti- 
cle of dust. In the photo- 
effect process, the par- = 
ticle sf duet loses an alec: ug 4 ysIO"® 16 
tron and, hence, depend- Fig. 186 

ing on the sign of gq, 

changes its charge by g-+e or q—e. The particle of dust is then no 
longer in equilibrium and begins to move towards one of the con- 
denser plates. To counterbalance the dust particle, it is necessary 
to change the field. The equilibrium condition is now 


(q +e) E,=mg. 





In this manner, Yofie determined the charge of an electron. 

Now, let us describe the Yoffe and Dobronravov experiment. 
Here, too, the behaviour of a dust particle suspended between the 
two plates of a condenser was observed, but now the goal was dif- 
ferent. The anode of an X-ray tube served as one of the condenser 
plates. A voltage of 12,000 volts was applied to the tube and the 
X-rays were created by an exceedingly weak electron stream of about 
1,000 electrons per second. 

As is well known, X-rays are created when an electron strikes 
an anode. But what is radiated by the anode? Is it a continuous 
electromagnetic field or 1,000 photons per second? The dust particle 
between the condenser plates enables us to obtain the answer. The 
X-rays dislodge electrons from the dust particle. But how do they 
do this? 

The Yoffe and Dobronravov experiment showed that, on the 
average, one electron was dislodged from the dust particle every 
30 minutes. If the X-rays were propagated in the form of a contin- 
uous field, then at each instant the dust particle would have ob- 
tained a very minute amount of energy, insufficient of course to 
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dislodge an electron. This energy would have been evenly distribut- 
ed among all the electrons of the dust particle. From the wave 
viewpoint, a quite inconceivable conclusion would have to be drawn 
from the Yoffe and Dobronravov observations, namely, that once 
every 30 minutes all the electrons transfer energy to one electron, 
which then escapes from the dust particle. 

The photon hypothesis not only explains the phenomenon qual- 
itatively but quantitatively as well. The dust particle in the above 
experiment consisted of a bismuth spherule having a radius of 3x 107° 
cm. It was located at a distance of 0.02 cm from the anode, from 
which X-rays emerged in all directions. The probability of a pho- 
ton striking the dust particle is ot — Teo Sincein 1 
second 1,000 photons are dislodged, on the average 1 photon 
will strike the dust particle every 1,800 sec (30 minutes), which 
agrees with the experimental result. 


163. Fluctuations in Luminous Flux 


The experiments of S. I. Vavilov devoted to the study of fluc- 
tuations in luminous flux of low intensity provide important ex- 
perimental corroboration of the photon theory. 

It turns out that the eye’s threshold of sensitivity to light is 
exceedingly low. The human eye is capable of perceiving approxi- 
mately 100 photons per second falling on the cornea. If the luminous 
flux fluctuates about this value, light will not be perceived by the 
eye when the number of photons drops below the threshold value. 

In the Vavilov experiments, the investigator observed a beam 
of light that was discharged every second for a time interval of 0.1 
sec. When the value of the luminous flux exceeded the threshold 
of sensitivity, the eye perceived every flash of light. When the 
light intensity was decreased, some of the flashes were no longer 
perceived by the observer. The lower the light intensity, the greater 
the number of flashes that were not perceived. Thus, fluctuations 
in the number of photons in the luminous flux were directly ob- 
served. It is difficult to provide more direct evidence of the corpus- 
cular nature of light. 

Other experiments performed by Vavilov clearly show that 
such typically wave phenomena as interference cannot be explained 
by the photon hypothesis. Using a Fresnel biprism, Vavilov divided 
a beam of light into two coherent components. These components 
yielded an interference pattern. At the same time, fluctuations in 
both beams of light were completely independent. This circum- 
stance again shows that it is quite impossible to explain interference 
as some statistical distribution of photons. 
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Wave properties are inherent in every photon rather than in a 
stream of photons. Thus, a photon can in no way be viewed as an 
“ordinary” particle. 

At this point, we must digress somewhat. In creating a model 
of the invisible world, we endow elementary particles with properties 
borrowed from the world of things (materials) around us, or, as they 
say in physics, from the macroworld. Thus, for example, atoms are 
conceived as spherules. Needless to say, an atom spherule only par- 
tially reflects the properties of a material spherule. Everyone knows, 
for example, that such properties inherent in a material spherule 
as colour, roughness and odour cannot be transferred to an atom sphe- 
rule. The more we penetrate into the microworld, the more difficult 
it is to endow elementary particles with material properties. 

Components of an atom or atomic nucleus and particles of light 
resemble a material spherule even less than an atom does. In the 
case of a photon, we saw that it is possible to combine in a micro- 
particle conflicting properties of the macroworld. Of course, in the 
macroworld, a particle is a particle and a wave is a wave. A particle 
occupies a limited region of space and travels along a definite path. 
A wave is distributed continuously in space and the energy is trans- 
ferred to one or another region from all points in space. For mate- 
tials, these two views are irreconcilable. But we have no right to 
impose the behaviour of materials on particles of the microworld. 

Cognition of the microworld does not consist in the creation 
of a model resembling the pictures familiar to the human eye. The 
infinite process of cognition consists in the investigation of the 
regularities of phenomena, the determination of objectively existing 
causal relationships between phenomena. In this manner, a complex 
picture of the microworld, whose essence cannot be transmitted by 
any ingenious model borrowed from the macroworld, is obtained. 


164. Kirchhoff’s Law 


It has been experimentally established that two bodies having 
different temperatures tend to equalise their temperatures even 
when the bodies are in a vacuum. The energy exchange occurs by 
Hers of electromagnetic waves radiated by the atoms of these 

odies. : 

' As was indicated above, a specific system of energy levels is 
associated with every atom. When an atom absorbs energy, its 
energy level rises; when it radiates, its energy level decreases. Dur- 
ing every radiation process, an atom releases into space an electro- 
magnetic energy hv,,,,=C,—G6,, where €6,, isthe energy level before 
radiation and @,, the level after radiation. The radiated wave has a 
frequency v,,,- This wave arrives at the other body and is absorbed 
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by it. In this case, the energy level of the atom absorbing energy 
is raised from 6, to @,- 

The same thing may be expressed in terms of photons. Thus, it 
may be stated that during every radiation process a photon of 
electromagnetic energy Av is released; during absorption the photon 
is captured by an atom and its energy serves to raise the energy 
level of the atom. 

All the atoms of the bodies participate in the energy exchange— 
sometimes a photon is absorbed, sometimes a photon is radiated. 
Depending on the random circumstances, the most varied energy 
transitions may occur and, in principle, electromagnetic waves of 
any wavelength may participate in the energy exchange. 

Let us assume that the bodies participating in the heat exchange 
form a closed system, i.e., the system of bodies under observation 
is surrounded by an envelope that prevents radiation from passing 
through. Then, after a certain interval of time, these bodies reach a 
state of equilibrium and assume the same temperature. This does 
not mean that electromagnetic radiation ceases. As before, a tran- 
sition will sometimes occur to a higher energy state of an atom and 
sometimes to a lower. But if the equilibrium state has been reached, 
then for each body, at each instant of time, equal quantities of 
energy will arrive and leave. This is true for radiation of any wave- 
length. In general, the radiation arriving at a body is only partial- 
ly absorbed, raising the energy levels of its atoms. The other part 
of the incident radiation is scattered, i.e., reflected, by the body. 

Atoms do not maintain their high energy levels long; in returning 
to their original state, they give up the absorbed energy in the form 
of radiation. If the energy incident on a unit area in ! sec is designat- 
ed by @, the absorbed energy may be expressed as Ag. The dimen- 
sionless coefficient A, indicating the fraction of energy that is ab- 
sorbed, is known as the absorptivity of the body. Evidently, if 


Ag=6, 


where @ is the energy radiated from 1 cm’ of surface in 1 sec, the body 
is in equilibrium with its surroundings and its temperature does not 
change. 

But what is the condition for thermal equilibrium of many bodies, 
which may have, of course, different absorptivities and different ra- 
diations? On the basis of thermodynamical considerations, Kirch- 
hoff showed that equilibrium is possible only if the intensity of the 
electromagnetic waves incident on a body is the same for all portions 
of the bodies in equilibrium with one another. Thus, 
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This relationship is known as Kirchhoff’s law and is valid for any wave- 
length and any temperature. It states that the ratio of the emissive 
power of a body to its absorptivity is a constant for any wavelength 
and temperature. 

This means that a body that is a good absorber of certain rays 
is also a good radiator of these rays, and vice versa. Why does the 
temperature of water in a bottle coated with silver rise slowly, and 
the temperature of water in a dark flask rise rapidly, under the action 
of solar rays? In the first case there is little absorption of solar ener- 
gy, while in the second there is considerable absorption. Now, let us 
assume both vessels are filled with hot water and placed in a refrig- 
erator. The water in the dark flask cools much more rapidly since 
the better absorber is also a better radiator. 

A striking experiment may be performed with coloured ceramic. 
If the colour of a body is, for example, green, it will not absorb green 
light. Thus, if we heat a green crock, it is seen that it begins to as- 
sume a colour complementary to green. 

It should not disturb us that we have applied a law established 
for equilibrium to phenomena involving bodies clearly not in equi- 
librium (the body is at a higher temperature than its surroundings). 
The situation here is exactly the same as in the case of other thermo- 
dynamic problems (cf. p. 163): the laws of thermodynamics are appli- 
cable if every instantaneous state may be viewed as an equilibrium 
state. .In thermal radiation phenomena, this condition is always 
satisfied. 


165. Black-Body Radiation 


Kirchhofl’s law has an interesting consequence. Bodies exchanging 
heat by means of radiation receive (for given values of v and 7) 
the same electromagnetic wave intensity from their neighbours, in- 
dependent of the material and properties of the bodies. For every 
wavelength (or, what amounts to the same, every frequency) and for 
every temperature, experiments yield a universal value for @. Thus, 
there exists a universal function @ (v, 7), i.e., a function of the ra- 
diation frequency and temperature, characterising the process of 
thermal exchange by radiation. 

The meaning of the function @ (v, T) is easily explained. Consider 
a body absorbing 100 per cent of the energy incident on it for all 
wavelengths. For such a perfectly black body, A ~1 and 


C= Q (v, fae 


The function @ (v, 7) is the emissive power of a perfectly black body. 
But what kind of body absorbs light of all wavelengths? Of course, 
substances such as lampblack (soot) are almost perfectly black. 
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However, all such substances fall short by several per cent of the con- 
dition A =1. A more ingenious solution exists. Imagine a box having 
a small aperture. If the aperture’s dimensions are made suificiently 
small, it may be made perfectly black. This property of apertures is 
well known from everyday observations. A deep hole, an open win- 
dow nonilluminated from within the room and a well are examples 
of perfectly black “bodies”. It is clear what happens in these cases: 
A beam entering a cavity through an aperture is able to emerge only 
alter repeated reflections (see Fig. 187). But with each reflection, 
part of the energy is lost. 
Therefore, in the case of a 
small aperture and a large 


Radiation intensity 


ws 





Fig. 187 Fig. 188 


cavity, the beam is unable to emerge, i.e., it is completely ab- , 
sorbed. 

To measure the emissive power @ (v, T) of a perfectly black body, a 
long tube made of refractory material is placed in an oven and heat- 
ed. Through an aperture in the tube, the nature of the radiation is 
studied by means of a spectograph. The results of such experiments 
are shown in Fig. 188. The radiation intensity is plotted as a function 
of the wavelength for several temperatures. It is seen that the radia- 
tion is concentrated in the relatively narrow spectral interval of 1 
to 5u. Only at high temperatures do such curves take in portions of 
the visible spectrum and begin to advance in the direction of short 
waves. Waves whose wavelengths are several microns long are 
called infrared waves. Since for ordinary temperatures they are the 
main carriers of energy, we call them heat waves. 

The higher the temperature, the more distinct the maximum of 
a thermal radiation curve. With increasing temperature, the wave- 
length 4,, corresdonding to the maximum of the spectrum is displaced 
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in the direction of shorter wavelengths. This displacement obeys 
the Wien law, which is easily established experimentally: 


2,886 
Na me . 


In this formula, the wavelength is expressed in microns and T in 
degrees Kelvin. The displacement of radiation in the direction of 
shorter wavelengths may be detected when a metal is heated. As the 
seein increases, the colour of the heat changes from red to 
yellow. 

We call the reader’s attention to another feature of the radiation 
curves, namely, it will benoted that all the ordinates increase sharp- 
ly with increasing T. If @, is the intensity for a given wavelength, 
the total intensity of the spectrum is expressed by the integral 


co 


R=\ 6d). 


tt) 


This integral is simply equal to the area under the radiation curve. 
Exactly how rapidly does R increase with increasing 7? Analysis 
of the curves shows that it increases very rapidly, namely, propor- 
tionally to the fourth power of the temperature. 


R=oT" ergs/ cm? sec, 


es o =5.7 x 10-* (Gaussian units). This is the Stefan-Boltzmann 
aw. 

Both laws are important in the determination of the temperature 
of hot bodies at great distances. In this manner, the temperature of 
the Sun, stars and the ball of fire of an atomic explosion are deter- 
mined. 

The laws of thermal radiation are basic to the determination of 
the temperature of smelted metal. The operation of optical pyrome- 
ters is based on the selection of the heating for an electric bulb fila- 
ment in such a manner that the luminosity of this filament becomes 
the same as the luminosity of the smelted metal. We make use of 
the following law: if radiations are the same, so are the temperatures. 
As for the temperature of the heated filament, it is directly propér- 
tional to the electric current flowing through the filament. Hence, 
it is not difficult to calibrate an optical pyrometer. Since actual 
bodies are not perfectly black, it is necessary to introduce in each case 
in the Stefan-Boltzmann formula a factor less than unity (the absorp- 
tivity of the given body). These factors are determined empirically 
and are significant in heat engineering where problems of heat ex- 
change by radiation are extremely important. Nevertheless, the 
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above laws are valuable since the general behaviour of radiation 
(dependence on temperature and wavelength) is maintained for 
bodies that arenot black as well. The theoretical aspects of black-body 
radiation are discussed in the following article. 


166. The Theory of Thermal Radiation 


Let us consider a cavity within which absorption and radiation of 
electromagnetic waves occur. It is immaterial whether this cavity 
is in the form of a sphere or a rectangular parallelepiped. The walls 
of the cavity radiate and absorb equal quantities of energy, i.e., 
the entire system is in equilibrium. Within the cavity there is an elec- 
tromagnetic field which is in equilibrium with the walls: at all 


points the energy density of the field, w= 2 (E*+H%, is constant in 


time. . 

This electromagnetic field may be viewed in two different ways. 
From one viewpoint, there are standing electromagnetic waves in 
the cavity, just as there are standing sound waves in a closed room 
with sound sources. From the other viewpoint, in view of the quan- 
tum nature of the field, it may be stated that the space under consid- 
eration is filled with photons, just as a vessel containing gas is 
filled with molecules. 

From the wave viewpoint, the number of frequencies of electro- 
magnetic oscillations occurring in the cavity may be easily deter- 
mined. The reasoning used for sound waves (see p. 135) is completely 
applicable here too. The number of characteristic frequencies of elec- 
tromagnetic oscillations less than v is equal to 

4 8 V 

3 te" 
where ¢ isnow the velocity of electromagnetic waves and V is the vol- 
ume of the cavity. This formula gives the number of oscillations 
for the case of linearly polarised waves. In the caseof thermal radia- 
tion, we are dealing with nonpolarised oscillations, which may be al- 
ways resolved into components along two axes. 
; Here, the number of oscillations is twice as large and is equal to 


aay V. Differentiating, we obtain the number of oscillations in 


the frequency interval from v to v+dv: 


2 
Baw Vdyv. 
z 





Now, let us consider the situation from the “other side of the coin”. 
From this viewpoint, the cavity is filled with oscillations of frequency 
v—in other words, with photons of energy e=hv. The expression 
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Sav Vdv may be viewed as the number of photons in the cavity, 


and Sav dv as the density of the photon gas. We shall soon be able 
to answie the following important question: What is the electromag- 
netic energy density in the cavity? If photons of all energies were cre- 
ated i Saal BUND ERS: it would merely be necessary 1G uliiply 





iene dv. However, the particle energies are not distributed uni- 
formly. Therefore, the formula eb: sought has the form 


widv = cae 





where W(e) is the probability of a photon of energy ¢ being created. 
Thus, the electromagnetic energy density for waves (photons) 
of frequency v is given by the formula 


7 2 
WwW, = oa eW (e). 


The energy flux through a unit area, i.e., the Poynting vector 
‘K, is e times w, (see p. 322). But the energy flux @ radiated from a unit 
area of a body in equilibrium with the field is one-fourth of the val- 


ue of the Poynting vector: e=tk. Thus, between w and e, the fol- 
lowing relationship exists: Q= tw. What is the origin of the coel- 


ficient 4? Since, on the whole, this is not a very important matter, 


the following simplified explanation should suffice. 
Every unit area radiates an energy flux @ in all directions within 
the limits of a hemisphere, i.e., a solid angle 20. Thus, the average 


radiation within a unit solid angle is equal to s. From geometry 


considerations, it is clear that the radiation is equal to zero in the 
plane of a unit area and a maximum along its normal. If the decrease 
in radiation intensity were uniform, to obtain the average value 


x it would be necessary for the radiation along the normal to be 


equal to 2. 
Now, consider a sphere filled with radiation. At the centre of the 


sphere there is a unit area through which there is an energy flux KX. 
On the other hand, however, the radiation falling on this area from 


all parts of the sphere is equal to “ x<4n. Hénce, o= 4 K, 
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Thus, using the formula for the volume density of electromagnet- 
ic radiation, we obtain an expression for the emissive power of a 


black body by multiplying w, by +: 


Qa? 
Cc 





0, = eW (e). 


Further investigation of this function involves evaluation of 
We), the energy distribution probability. Historically, the first 
formula for @, was proposed in 1911 by Rayleigh and Jeans independ- 
ently of each other. It has the following form: 


QnkT 2 
yo G2 





This formula was obtained by assuming uniform distribution of ener- 
gy per degree of freedom, i.e., W independent of «. It is valid for 
long wavelengths and high temperatures. 

Another possibility for W(e) is to use the Boltzmann law, which 


was so successful in the case of molecular gases. Then, W(e)=e # 


However, as may be seen from Fig. 189, both the Wien emissive pow- 
er formula, 
hy 


Qnv? cates 
A= e hve *, 





and the Rayleigh-Jeans formula do not agree with experimental re- 
sults. 

Where is the fallacy in reasoning in these cases? It must be sought 
in the inapplicability of the statistical reasoning lying at the basis 
of Boltzmann’s law to an aggregate of photons. As we have already 
emphasised, photons give us a one-sided picture of an electromagnet- 
ic field. The reality of the field cannot be completely represented 
by a collection of particles. It is, therefore, natural that photons 
should have their “own statistics’ (Bose-Einstein statistics). 

To obtain the new function of particle energy distribution replac- 
ing the Boltzmann law, all we need do is take proper account of the 
fact that the wave nature of the field makes the concept of a dif- 
ference between identical particles meaningless. Bose-Einstein sta- 
tistics is founded on this new basis (see p.678). It leads to the following 
law of photon energy distribution: 


1 
hy + 


ée 





W (e)= 


kT oy 
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with experiment) 





Per cent deviation 
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Therefore, the formula for the emissive power of a perfectly black 
body has the following form: : 





__ 2nv? hy 
yo G2 hy 
err | 


This formula was first obtained by Planck and is named after him. 
The excellent agreement between this theoretical formula and exper- 
imental results, and the nature of the deviations of the Wien and 
Rayleigh-Jeans formulas, are illustrated in Fig. 189. 

The Wien and Stefan-Boltzmann laws considered above follow 
from Planck’s formula. To prove the first of these laws, it is necessary 
to solve the problem for an extremum, i.e., find the root of the equa- 
tion 

dQ, 


ov 
To prove the second law, it is necessary to find 


j Q, dv. 


We leave these calculations to the reader. 


PART THREE 
STRUCTURE AND PROPERTIES OF MATTER 


CHAPTER XXVI 
‘STREAMS OF CHARGED PARTICLES 


The simplest form of matter is an aggregate of charged elementary 
particles—electrons and ions. Systems of charged particles are encoun- 
tered in the form of beams of particles in which all the particles have 
a common velocity and move in a single direction and in the form of 
a gas in which the particles move randomly. Intermediate states 
are, of course, also possible. In this chapter, we shall consider the bas- 
ic physical phenomena of such systems and describe equipment in 
which beams and gases of charged particles are used. Problems of 
electron emission, directly related to the physics of solid bodies, will 
be discussed in Chapter XXXVII. 


167. Motion of Charged Particles in Electric and Magnetic Fields 


A force f=eE+-— [vB] is exerted on a charged particle in an 


electromagnetic field (see p. 264). If the fields E and Bare given as 
functions of coordinates and time, and if the initial velocity and lo- 
cation of the particle are known, then for a particle moving with a 
velocity v<c the particle trajectory r(¢) may be determined from the 
fundamental law of mechanics: 


ad’r 
maps. 


It is usually mathematically difficult to obtain an exact solution to 
this problem. An idea of the general nature of motion in a field may 
be obtained from an examination of the motion of a charge in a uni- 
form field. 

A Particle in an Electric Field. Assume that a particle enters a 
field at an angle90°--a (see Fig. 190). For the choice of coordinates 
shown in the figure, the equations of motion take the form: 

dvy é du 
7 eee and i = 9 
Whence, 
e 


0, EL Gey and 0, == Uo,- 
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Integrating again, and assuming «=O for ¢=0, we obtain 


1 
=—55 Et? + vo,t and x= Vogts 


Eliminating, we obtain an equation of a parabolic curve which de- 
scribes the motion of the electric charge (dotted in Fig. 190). 

If the particle enters the field at 
tight angles (v,,=0), its path is de- 
scribed by the “equation 

0 

If the particle enters the field along 
a line of force, it will continue to 
move along the line of force with an 
acceleration <E. 

Fig. 190 Designating the potential difference 

between the initial and final posi- 

tions of the charged particle by V and using the kinetic energy 
equation, we obtain 





eV =F (vu? —v}). 
If the final velocity vS>v,, then 
eV= a and v= V 220. 


This equation helps to make clear why the unit of energy known 
as the electron-volt is widely used: 


1 ev=1.63x 10" erg. 


An electron-volt is the work done in moving an electron through 
a potential difference of 1 volt. This unit may be conveniently em- 
ployed when the energy refers to a single elementary particle. The 
work of ionisation and the dislodging and escaping of an electron 
from a metal range from several to several scoré of electron-volts. 

A Particle in a Magnetic Field. The properties of the force acting 
on a charged particle in amagnetic field are well known (see p. 265). 

Assume a particle enters the field with an initial velocity v,. Let 
us resolve this vector into the components v, and v,’, which are paral- 
lel and perpendicular to the field, respectively. Then, for motion in 
the plane perpendicular to the field, we obtain 





ma=-~ vB. 
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In the longitudinal direction, the particle will move uniformly with 


a constant velocity Oy: 
Vv, 
Motion in the perpendicular plane is circular, anda= = 


is ae centripetal acceleration. Thus, 


mov? 
ee a 
c =0 B= R 








Hence, R= ay i.e., the radius of curvature is directly proportion- 


al to the particle velocity and inversely proportional to the magnet- 
ic induction. It should be noted that all particles of a given kind in 


a given field will have the same angular velocity, i.e., o=2, 


Irrespective of the magnitudes and directions of the velocities, the 
particles will have the 
same frequency of revo- 
lution about a flux line. 
If a particle enters the 
field at an incline to the 
direction of the field, it 
will move with a frequen- s 
cy @ in a spiral of radius Fig. 191 
R (Fig. 191). Knowing 
v, the projection of the velocity on the direction of the flux lines, 
we may determine the pitch of the spiral: 





Q0 2Q0me 
Z=0,T=0, x= eB 





It is significant that the quantity v,; =v, cosa, where a is the angle 
formed between the initial velocity vector and the direction of the 
field, is constant to a high degree of accuracy even when the angular 
spread of the initial velocities is 5-10° (0, will vary by no more than 
1 per cent in such a case). Therefore, every z centimetres such a di- 
vergent beam of charged particles will converge in a point, i.e., 
focus (within the indicated limits) on a generating line of the cyl- 
inder on which the spiral trajectory may be viewed as winding. This 
generating line passes through the point at which the particle entets 
the field. 


Example. Assume that an electron, after being accelerated by a voltage 
V=300 volts, enters a magnetic field of flux density B=500 gauss at an angle 
a==30°. The velocity of the electron is 


2eV 2x4.8X10-X1 9 
t= yf 2 =f et Sipe = 10° cm/sec. 
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Note that Oh ede It is pointless, therefore, to introduce a relativistic correc- 


tion in this calculation. 


0 =v, cos a=0.87%X 10" cm/sec and v1; =0.5X 10° cm/sec. 


The radius of the cylinder on which the spiral trajectory of the electron may 
be viewed as winding is 


mUvio 
—  t _ 9x10-* «0.5 109X383 1019 56 : 
R= eB 4.8X10—1x 500 = 0,056 cm, i.e., 








its diameter is somewhat greater than a millimetre. The angular velocity is 


—@B _ 4.8 10-1x500 10 4 
O = ne = EXlO= Bax low — 0-89 x 10"? rads jsec. 





23.14 


Deexiow = 9.6 cm. 


The pitch of the spiral trajectory is z= 0 T=0.87 10° 


168. Ionised Gas 


The transformation of an atom into a positive ion, i.e., the remov- 
al of an electron from an atom, may be achieved in a variety of 
ways. Collision with an electron, another atom or a molecule, absorp- 
tion of a photon—all these methods of transferring energy may result 
in the ionisation of an atom if, thereby, sufficient energy is transferred 
to overcome the binding force between the electron and the atom. 
In the case of diferent atoms and molecules, this energy ranges from 
4 to 25 ev (seep. 499). This means that an atom may be ionised by an 
eleciron accelerated by a voltage of 4 to 25 volts. A particle posses- 
sing more energy can, of course, transform as many atonis or mole- 
cules into ions as its energy reserve permits. One electron accelerated 
in a powerful accelerator is capable of creating millions of ion pairs. 

Ionisation of atoms involves the tearing away of electrons. Ioni- 
sation of molecules may also involve the removal of electrons, but 
in some cases upon ionisation a molecule may break up into two 
large ions. Thus, ionisation not only creates electrons and positive 
ions, but may alsoresult in the formation of negative ions. However,. 
negative ions are also obtained by another method, namely, the at- 
tachment of a free electron to a neutral atom. It transpires that 
such a process results in the release of energy (see p. 506). 

Ionisation processes in solid bodies will be discussed in detail in 
Sec. 258. Here, the ionisation process interests us only as a method 
by means of which streams of ions may be created. From this view- 
point, only the ionisation of gases is of interest. If we wish to obtain 
a stream of ions of a substance which exists under.normal conditions 
in the solid or liquid state, it is necessary to resort to vaporisation. 

Let us consider the ionisation produced in a stream of gas particles. 
If the source of ionisation is removed, the positively and negatively 
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charged particles will begin to recombine into neutral atoms or mol- 
ecules. Since upon recombination two particles come together, it 
is clear that the rate of recombination is proportional to n?, the 
square of the number of ions in a unit volume. If An is the number of 
ions transformed into neutral particles per unit volume in a unit 
time, then An =yn?, where y is a coefficient of the order of 10-°cm*sec~' 
for most gases under normal conditions. Under the continuous action 
of an joniser, an equilibrium is established between the ionisation and 
recombination processes. Assume that the ioniser creates N ion pairs 
per unit volume in a unit time. At first, the number of ions in the gas 
increases, but since recombination occurs more and more frequently 
(proportional to the square of the number of ion pairs present), the 
increase in ionsceases when N =An,i.e., N=yn’". If an ionised gas 
fills a certain volume, and if the motion of the gas particles is pre- 
dominantly random, such a conducting gas consisting of neutral and 
charged particles is called a plasma. 

A gas may be brought to an ionised state and to a plasma state by 
increasing its temperature. The thermal ionisation of a gas begins 
at a temperature of the order of 6,000°. The average energy of molec- 


ular motion, = kT, becomes sufficient to provide enough molecular 


collisions with the requisite energy to dislodge an electron or pro- 
duce other ionisation. 

The degree of ionisation depends on the temperature and pressure 
of a gas. With increasing pressure, ionisation decreases. For tempera- 
tures of tens of thousands of degrees, ionisation is close to 100 per 
cent at a pressure of 107*mmof Hg. Such a thermally ionised gas is 
a plasma in astateof thermodynamic equilibrium with its surround- 
ings. The average kinetic energy of every neutral and charged par- 
ticle in a plasma is the same, and it determines the temperature of 
the plasma. Such plasmas exist in high-temperature stars. 

The ionosphere is an important example of a highly ionised gas. 
The number of charged particles in a unit volume of the ionosphere 
fluctuates considerably, both daily and annually. Such fluctuations, 
as we know, affect radio reception. There are ~ 10° electrons and ions 
in a cubic centimetre of the ionosphere. The total number of particles 
in such a volumeis ~10°. Thus, the ionosphere has an ionisation of 
1 per cent. In intense plasmas produced in a variety of ionic devices, 
the degree of ionisation is of the same order of magnitude. 


169. Electric Discharges in a Gas 


Physicists first studied elementary charged particles by passing 
an electric current through a gas (electric discharge in a gas). A glass 
tube filled with gas is incorporated in a circuit and the connections 


446 Streams of Charged Particles (Chap. XXVI 


are made to electrodes sealed in the glass. Electric discharges have 
been studied using various gases, pressures and field intensities. 

Different gases generally behave in the same way. A difference in 
ionisation potential (see p. 499) merely means that certain critical 
points, to be discussed below, exist at other voltages and pressures. 

Let us consider the phenomena, characteristic of every gas, which 
occur when the voltage applied to a gas-discharge tube is increased. 
The phenomena to be described take place over a broad range of pres- 
sures. We shall merely exclude from our discussion such low pressures 
for which the free path of the molecules becomes commensurable 
with the dimensions of the gas-discharge tube and high pressures 
for which the gas density approaches the density of liquids, i.e., 
where the molecules do not have a free path. The reader will soon 
see why the free path of a molecule is of such great importance. 

A low voltage is applied across the gas-discharge tube. If there is 
no ioniser present, current does not flow through the tube. If an ionis- 
er is present, charged particles (ions and electrons) in the gas are 
urged toward the electrodes by the field. This phenomenon may be 
called dependent conduction. The rate at which charged particles 
migrate toward the electrodes depends on many factors, in particu- 
lar, the field intensity and gas pressure. 

A random thermal motion is superimposed on the ordered motion 
of the ions and electrons, which occurs under the action of the con- 
stant electric force. The particles accelerated by the electric field 
travel only a short distance, since they inevitably collide with 
other particles. These collisions are elastic when the velocities are 
low. 

The mean free path of the particles is determined, in the first place, 
by the gas pressure. The higher the pressure, the shorter the free path 
and the lower the mean velocity of the ordered motion. On the other 
hand, the higher the applied voltage, the greater the mean velocity 
of the ordered motion of the particles. 

As indicated in the preceding article, a specific ion concentration 
is established in a gas under the action of an ioniser; in the equilib- 
rium state, the number of newly formed ions per second is equal to 
the number of recombinations during this same interval of time.. 
When a voltage is applied, the equilibrium is disturbed, i.e., some 
of the ions reach the electrodes before recombination can occur. As 
the voltage is increased, a larger and larger proportion of the ions 
created in a unit time reach the electrodes, and the electric current 
through the gas increases. This tendency continues until no time at 
all is left for recombination, i.e., all ions created by the ionisers 
reach theelectrodes. It is clear that a further increase in voltage can- 
not increase the current. The flat portion of the curve in Fig. 192 
represents this saturation current, 
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The lower the gas density, the lower the field intensity which is 
required to achieve current saturation. 

The saturation current is equal to the rate of ion charge forma- 
tion in the gas-discharge tube. 

If the voltage is further increased, new phenomena occur. At a 
certain point, the electron velocity becomes sufficient to dislodge 
electrons from neutral atoms and molecules. For this purpose, the 
voltage across the tube must be 
high enough to enable an electron / 
to accumulate sufficient energy in 
its free path to ionise a molecule. Saturation 

When ionisation occurs as a re- 
sult of impact, the shape of the cur- 
rent-voltage curve is affected, i.e., 
the current begins to rise since a ~# Ee 
voltage increase produces an in- 7 v 
crease in electron velocity. This Fig. 192 
higher velocity makes it easier to 
ionise particles. A large number of ion pairs are created and the 
current strength increases. 

In this voltage range, the passage of current through a gas is accom- 
panied by optical phenomena, i.e., the gas glows: If particle colli- 
sions can result in the ionisation of atoms and molecules, all the more 
so can they result in the excitation of particles, i.e., transitions to 
a higher energy level. In returning to its normal state, an atom or 
molecule radiates a quantum of light. We shall not go into this sub- 
ject here, since radiation by excited atoms and molecules will be 
discussed on numefous occasions below (see Chapters XXVIII and 
XXIX). 

If the electron energy is several times greater than the energy re- 
quired to ionise a molecule, the passage of electric current through a 
gas is of a distinctly avalanche nature. When such an electron strikes 
an atom, an ion and an electron are created. But this new electron 
is also able to ionise particles. Moreover, the primary electron still 
possesses sufficient energy to ionise other atoms. The process is 
cumulative—an avalanche of electric charge, instead of just primary 
ionisation, moves toward the electrodes. In each successive layer, the 
number of ion pairs is greater than in the preceding one. When the 
voltage is fairly high, this avalanche grows extremely rapidly. 

The secondary ionisers in the gas are electrons, not ions. The lat- 
ter are able to ionise gas molecules only when the velocities are very 
high, i.e., greater than those usually prevailing. If the ions do not 
produce ionisation, the removal of the external ioniser stops the dis- 
charge even when the number of ion pairs due to impacts is hundreds 
ot thousands of times larger than in the primary ionisation. Every 
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avalanche is initiated by a primary electron, and since the electrons 
migrate toward the anode, the discharge ceases in the absence of an 
external ioniser as soon as all the electrons arrive at the anode. 

Such highly dependent discharges possess the following property: 
for a given voltage, the strength of the electric current passing 
through the gas is proportional to the number of primary ions created 
per unit time by the external ioniser. The ratio of the gas-amplified 
current to the saturation current created by the primary ionisation 
may reach a value of several thousand. This property of a discharge 
is utilised in ionisation measuring devices—proportional amplifiers 
(see p. 528). 

An electric discharge may become self- -perpetuating, i.e., con- 
tinue after the external ioniser is removed, only if the ions also be- 
come creators of charged particles. This always occurs when the volt- 
age is very high, i.e.—as indicated above—when the ions can ionise 
upon impact with gas molecules. In such a case, the ions will creaie 
more and more new electrons—the primary sources of avalanches. 

However, a self-perpetuating discharge may also occur at consid- 
erably lower voltages if the cathode of the gas-discharge tube takes 
the form of a plate. This is because ions are capable of dislodging 
electrons from a cold cathode. If the ion velocity is sufficient for such 
a process, the condition for a self-perpetuating discharge may be for- 
mulated as follows: new electrons appearing at the cathode must, at 
the very least, replace the work of the primary ioniser. 

Thus far we have said nothing about the role of pressure. At high 
pressures, the discharge column is compressed and thermal ionisa- 
tion begins. When the pressure changes, the current density distri- 
bution changes and so does the luminous nature of the gas discharge. 
At normal and higher pressures, various kinds of discharges are en- 
countered, e.g., silent, arc and spark discharges. In the rare gases, 
glow discharges occur. 

A discharge is said to be silent when a leakage of charge from a con- 
denser or other charged body is unaccompanied by sound or luminos- 
ity. Self-perpetuating silent discharges—brush discharge and co- 
rona—may occur on spikes, thin conductors and, in general, wher- 
ever sharp drops in potential, and, therefore, large field intensities, 
exist. 

At higher voltages, spark discharges occur, i.e., the gas breaks 
down. The breakdown voltage depends almost exclusively on the gas 
pressure in the region between the electrodes. At normal pressure, 
the air between spherical electrodes breaks down when the field 
intensity is 30 kv/cm. Measurement of the breakdown distance may 
serve as a measure of high voltages. 

An electric arc is a special type of discharge. The current density 
in such a discharge is large even though the voltage between ile 
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electrodes is low. The distinctive feature of an arc discharge, usually 
created between carbon electrodes, is the extraordinary high tempera- 
ture attained by the electrodes. Therefore, thermionic emission from 
the cathode plays a large role in 
an arc. 

In the rare gases, a glow dis- 
charge has a characteristic form 
for every pressure. The degree of 
rarefication may be determined 
experimentally with great accur- 
acy by mere observation of the 
discharge form. Various types of 
gas discharge forms are shown 
in Fig. 193. 


170. Beams of Charged Particles 


In a gas-discharge tube, an 
electron stream moves in the 
opposite direction to a stream 
of positive ions. To obtain an 
ion ray, i.e., a beam of ions 
moving in one direction, a hole 
or canal is made in the cathode. 
A large proportion of the ions 





entering this aperture passes ~ p= 002 mm of Hg 
through it and then continues : 
to move by inertia. Such beams, Fig. 193 


called canal or positive rays, 

were known to physicists as far pack as the last century. A similar 
method of obtaining an ion stream is used even today. First, a sub- 
stance is transformed into the gaseous state. Then, its molecules are 
ionised and the positive ions removed from the gas-discharge region 
through a cathode canal. 

A gas discharge is not.used to create an electron beam. A so-called 
electron gun serves as an electron beam source. This is a device based 
on the phenomenon af thermionic emission (see p. 690). Heated metals, 
as is well known, may serve as electron sources. Thus, 1 cm? of tung- 
sten surface heated to 2,400° yields in one second a number of elec- 
trons corresponding to a current strength of 1 ampere. 

Fig. 194 is a diagrammatic representation of an electron gun. 
To accelerate the electrons, a voltage is applied across the electrodes. 
A tungsten filament (/) heated by an electric current serves as the 
cathode. The anode (2) has the shape of a glass with a round hole in 
the bottom. Electrons emerge from this aperture, which determines 
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the divergence and width of the beam. The focussing electrode (3) 
makes it possible to obtain beams of electrons which are fine and in- 
tense (see Sec. 171). 

The problem of obtaining an electron beam of maximum intensity 
for a given expenditure of energy is of great engineering importance. 

To utilise all the electrons emitted by the filament, one must, 
in the first place, accelerate the electrons with a sufficiently high 
voltage. The filament emits a certain 
number of electrons per unit time. 
All these electrons must be drawn 
away from the filament. If the voltage 
is low, an electron cloud which im- 
pedes emission is formed near the fila- 
ment. As the voltage is increased, the 
cloud is gradually dissipated and the 
thermionic current increases. Finally, 
we reach a voltage with which no 
electron cloud is formed. A further 
increase in voltage does not result in 
an increase in thermionic current since 
saturation has been reached. This 
is the condition required for electron 
gun operation. Thus, a sufficiently 

Fi high voltage ensures that all of the 
g. 194 
electrons are drawn away from the 
filament. 

The next problem is to obtain increased electron emission from a 
filament. The emission from thorium and oxide cathodes is many 
times greater than that from tungsten cathodes. A thorium cathode 
consists of a tungsten wire coated with a very thin layer of metallic 
thorium. Thoriated tungsten yields the same current at 1,500° as 
pure tungsten does at 2,400°. An oxide cathode consists of a metal- 
lic base coated with a layer of an oxide of an alkaline earth metal. 
Such a cathode yields the same current at 900° as tungsten does at 
2,400°. In modern electronic devices, oxide cathodes are heated indi- 
rectly. The cathode is manufactured in the form of a tube in which 
there is placed a tungsten spiral heated by an electric current. 





171. Electron Lenses 


An electron beam may be controlled by electric and magnetic 
fields. The action of such fields is not restricted to the deflection of 
a beam from its original direction. Thus, a parallel beam of electrons 
may be made to converge or diverge, and a beam diverging at one 
point may be made to converge at another. A “lens” for an electron 
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gun is produced by a very simple system of fields. A very important 
branch of science known as electron optics, the most significant 
achievement of which is the electron microscope, has developed on the 
basis of this principle. 

Let us recall the properties of an ordinary double convex lens. 
If an object is placed on one side of such a lens, the image of the ob- 
ject on the other side will be magnified or reduced in size, This is 





Fig. 195 


because all rays emerging from an object point gather at an image 
point and, moreover, all image points are located in a single plane 
perpendicular to the axis of symmetry of the lens. The simple geomet- 
ric construction of Fig. 195 shows why the lens operates in such a 
manner: the angle of deflection of a ray which impinges on a lens is 
proportional to the distance A between the axis of symmetry and the 
point of intersection of the ray and the lens. The construction has 
been made for an object point lying on the axis of symmetry, but 
the results are similar for other points as well. It should be stipulated 
(as is done in optics) that the discussion is valid if the lens is thin 
and the beam divergent within the limits of a small solid angle. 

We shall now show that electric and magnetic fields having axial 
symmetry may serve as lenses. Such fields may be obtained by means 
of the following: electrically charged plates with a round aperture 
in one of them, cylindrical condensers, loops of current and flat coils. 
There are a large number of systems which may serve as lenses for 
electron rays. However, an example of an electrostatic lens and an 
example of a magnetostatic lens will suffice to clarify the principle. 

Let us consider a condenser in which a round aperture has been 
made in one of the plates (see Fig. 196). If an electron beam impinges 
on this aperture from the side of uniform field, the beam will be fo- 
cussed: When an electron reaches the region of nonuniform field, 
a force perpendicular to the equipotential surfaces, and therefore at 
an incline to the axis of symmetry, acts on it. Resolving this force 
into two components, we see that there is a radial component urging 
the electrons toward the axis. But this does not suffice for the system 
to act as a lens. It will also be necessary for the radial component of 
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the field to be proportional to the distance between the axis of sym- 
metry and the point at which the electron reaches the plane of the 
aperture. It may be ae shown that this is indeed the case. The 





| 
u=30 voll — 
Fig. 196 


radial component of the electric field intensity may be expressed in 
the form 


ld 
Bae a r, 


where is the field intensity gradient along the axis of symmetry. 


To prove this, consider a small 
cylinder oriented as shown in 
Fig. 197, where the distance 
1-2 is infinitely small. Since 
there is no charge inside the 
? cylinder, mr°dE, the flux differ- 
esas ence between the ends / and 2, 

must be equal to —E,2nrdz, 

i.e., the flux through the lat- 

Fig. 197 eral surface with the reverse 





sign. 
Thus, an aperture in a charged electric plate serves as a lens for an 
electron beam. 
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Now, let us consider the behaviour of electron rays passing through 
a flat current-carrying coil (see Fig. 198). Such a coil constitutes a mag- 
netostatic lens. The electrons move in aspiral and return to the axis 
of symmetry after completing one turn of a helix. The focussing prop- 
erties of the coil are evident. It may be shown that the deflection 
angle of a ray is proportional to the distance of this ray from the axis 
of symmetry. The magnetic 
coil changes the azimuth of 
the electron trajectory, i.e., 
in such a lens the image of an 
object is turned. But this an- 
gular displacement does not 
distort the  electron-optical 
image. 

Thus, for an object scattering 
or radiating electron rays, an 
“electron image” of the object 
may be obtained if an electro- 
static or magnetostatic lens is 
placed in the path of the scat- 
tered electrons. When a photo- 
graphic plate or luminous screen is placed in the plane of the image, 
a peculiar “picture” of the object is obtained. It is bright at points 
corresponding to the radiation or scattering of many electrons and 
dark at points corresponding to the absence of radiation or scatter- 
ing in the object. Since a system of electron lenses yielding a magni- 
fied image of an object can be constructed, it is possible to construct 
an electron microscope. : 





Fig. 198 


172. The Electron Microscope 


The electron microscope, i.e., a microscope in which the role of a 
light ray is played by a beam of electrons, provides exceptional 
opportunities, not yet fully utilised, to “observe” objects directly. 
This is because the possibilities of magnifying an object are, general- 
ly speaking, unlimited in an electron microscope. On the other hand, 
an optical microscope provides a magnification of not more than 
2,000-3,000. 

To understand the reasons for this difference, we must familiarises 
ourselves with the resolving power of a microscope. The question 
arises: What are the conditions for seeing two close points separately? 

Imagine that an ideal point source of light is located in front of 
a slit or round aperture. When light*passes through the aperture, a 
diffraction pattern is obtained. A lens placed behind the aperture 
does not concentrate the rays in a point. On the contrary, a blurred 


. 454 Streams of Charged Particles [Chap. XXVI 


circle (or band, in the case of a slit) surrounded by alternately bright 
and dark rings appears. On p. 365, the magnitude of this blur for a 
slit was calculated. The radius of the disk, to which a point diffracted 
from a circular aperture corresponds, was given on p. 366. It is equal 


to 1.22 ae 


Every optical instrument must have an aperture of entry—the 
objective. Difiraction at the objective is inevitable, and any lumi- 
nous point in the focal plane of theinstrument is diffused into a lumi- 
nous circle. The angular dimension of the radial blur is equal to 


1.29% . Therefore, its linear dimensions in the focal plane are equal 


to 1.2048 . Here, f and D denote the focal distance and the diameter 


of the objective, respectively. In the case of a microscope, this formu- 
la gives merely the order of magnitude since the object is close to the 
objective and, as a result, the beam of rays cannot be considered 
parallel. But since we are only interested in the qualitative picture, 
we shall not go into the fine points. 

If two luminous points observed in a microscope are so close that 
the centres of their luminous image fields are closer to each other than 
a distance equal to the field radius, these two points cannot be dis- 
tinguished as separate points. 

The limit of linear resolution in a microscope is equal to 1.294h Z 


Since the ratio of the focal distance to the objective diameter cannot 
be made significantly less than unity, a microscope enables us to ob- 
serve two points separated by a distance of the order of a wavelength. 
Thus, when viewing in ordinary light (wavelength of the order of 
0.5 w), we cannot detect object details smaller than a hundredth of 
a micron. 

What is the magnitude of useful magnification which may be ob- 
tained with an optical microscope? Imagine that a picture is viewed 
through an ocular, is photographed, then the latter photograph is 
viewed through an ocular, etc. It is evident that in this manner any - 
desired magnification can be achieved. However, further magnifica- 
tion loses all meaning when it is seen with the naked eye that the res- 
olution limit of points of a photograph has been reached. Thus, if 
a photograph obtained with an optical microscope is magnified so 
that 0.5-1 mm corresponds to one micron, the limit of useful magnif- 
ication has been reached. Hence, the useful magnification of such a 
microscope is about 1-2 thousand. 

As will be shown in the next chapter, an electron ray has the prop- 
erties of a wave of wavelength : 

A= — > 


mu 
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where f is Planck’s constant, m is the mass of an electron and v is 
its velocity. When the voltage is equal to 50,000 volts, the wavelength 
equals 0.05 A. But the distance between atoms is greater than 1 A. 
Hence, the usefulness of an electron micro- 
scope is not limited by its resolving power. 

Calculations indicate that the resolution 
limit of an electron microscope is 2-3 A. 
At present, it is possible to achieve a reso- 
lution of 5-6 A, i.e., a useful magnifi- 
cation of a million. 

It turns out that there is much in com- 
mon between light optics and electron 
optics. In electron-optical instruments, we 
find the same elements and the same prin- 
ciples of construction encountered in ordi- 
nary optical instruments. The main differ- 
ence (and this is not of a basic nature) is 
that the “index of refraction” of an electron- 
optical lens varies continuously, since the 
electric and magnetic fields vary continu- 
ously, while that of an optical lens varies 
abruptly (at its boundary). Fig. 199 is a 
diagrammatic representation of an electron 
microscope: /) electron projector, 2) con- 
denser lens, 3) object, 4) objective, 5) inter- 
mediate image, 6) projection lens, 7) final 
image, 8) observation: window. If we wish 
to examine an image directly, a fluorescent 
screen may be used instead of a photo- 
graphic plate. An electron microscope is 
much larger than an optical microscope, 
requires a source of electric voltage and 
costs considerably more. But this is com- 
pensated for by its tremendous resolving 
power. 

The electron microscope portrayed in 
the diagram employs magnetostatic lenses. A 
high vacuum, of the order of 10-* mm of Hg, is created in the systems 
in order to prevent electrons from colliding with air molecules. An 
electron gun produces a beam of electrons with an energy corre- 
sponding to 50,000 volts. Therefore, the installation must include a 
high-voltage transformer to boost the line voltage to the indicat- 
ed value. 

Different methods of observing an object by means of an electron 
beam exist. Since matter is a very strong absorber of electrons, 





456 Streams of Charged Particles [Chap. XXVI 


its thickness must be no greater than a fraction of a micron if we wish 
to observe an object in the “window”. When electrons pass through 
a thin layer of a substance, they arescattered differently by differ- 
ent portions of it. Fig. 200 illustrates the two methods used for elec- 
tron vision. Only those electron rays transmitted through the sub- 
stance without scattering are allowed to pass, while the scattered rays 





ees lies, 
a ) b ) 


Fig. 200 


are blocked by a diaphragm (Fig. 200a). In such a case, the brightest 
parts of the image correspond to those portions of the substance which 
do not scatter electrons, including those portions where the layer of 
substance is particularly thin. On the other hand, the parts of the 
object which scatter electron rays in profusion are dark, The second 
method is the reverse of the first (Fig. 2005). The object is placed at 
an angle to the axis of the microscope, so that only scattered electrons 
are directed through the lenses. It is evident that the roles of bright 
and dark fields in the image are now reversed. 

The examination of objects in an electron microscope is usually 
performed on a base the thickness of which is about 0.01 w. Such 
a base is made in the following manner. A drop of a solution of 
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collodion in amyl acetate is placed on the surface of water. The drop 
spreads on the surface forming a thin film which becomes quite firm 
after the amyl acetate evaporates. A loop made of thin wire is placed 
under the film. The object holder is now ready. This base will appear 
bright for normal incidence 
of the beam. It will appear 
dark when the beam im- 
pinges at an angle. 

If the objects under in- 
vestigation are poor scat- 
terers of electrons, they 
will not be seen very well 
against the common back- 
ground. The objects are 
sprayed with -a metal to 
obtain more contrast. The 
base with the mounted 
object is placed in the Fig. 201 
path of a stream of metal 
atoms produced by vaporising a metal in a vacuum. The spray is 
directed at an angle tothe base, and the sample becomes shaded as: 
shown in Fig. 201. When an object is examined with electron rays 
an exceedingly bright picture is obtained since electrons are scat- 
tered only from the parts of 
the object sprayed with metal 
atoms. Fig. 202 shows how 
flu viruses look under an elec- 
tron microscope. 

The examination of objects 
on a base is particularly iin- 
portant in biology and medi- 
cine. Bacteria are scooped up 
with the sample holder from 
the medium in which their 
presence is suspected. It is 
easy to study particles obtain- 
a in a suspended state since they can be scooped up with a 

older. 

Entirely different methods are used in examining the surface of a, 
solid. Under certain circumstances, a solid may be made to emit elec- 
trons. By passing the resulting beam of electrons through lenses, 
we are able to see thesurface. However, this procedure cannot always 
be used: when there is low emission, when the sample cannot be heat- 
ed, etc. Under such circumstances, the replica method is used. In 
this method, an object is coated with a thin layer of substance, whic: 
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point of which has a very small radius of curvature. It is possible 
to create near a cathode having this shape a field of the order of 
10’ volts/em. For such a field, electrons are torn away from a cold 
cathode in a radial stream. If an obstacle is located in the path of 
the stream, a dark image appears on the fluorescent screen (or pho- 
tographic plate). If an object lies on the surface of the point, the mag- 
nification is equal to the ratio of the distance between the point and 
the screen to the radius of curvature of the point. Using special means, 
the radius of curvature may be made less than 200 A. 
__ If molecules of a substance are placed on the point, their images 
appear on the screen. This has been done with phthalocyanide mole- 
cules, the dimensions of which are about 15 A. The form of the 
molecule, its characteristic four-petalled structure, and the concen- 
tration and rarefaction of the electron density were clearly visible 
on the screen. 

_ Although this method can certainly not be used for all objects un- 
der normal laboratory conditions, the possibilities of a method yield- 
ing a useful magnification of more-than one million should not be 
underestimated. 

But the resolving power may be increased by yet another order of 
magnitude and, moreover, the clarity of the image may be consider- 
ably improved. This may be accomplished by using an ion beam 
instead of an electron beam, but otherwise employing the same prin- 
ciple of object examination. ‘An ion projector does not differ in prin- 
ciple from an electron projector. The point is given a positive poten- 
tial and when the field is large (10° volts/cm) ions may be torn away. 
For this purpose, it is necessary that atoms or molecules be adsorbed 
by the surface of the point. either beforehand or during operation of 
the projector. In the instrument shown in Fig. 204, a small quantity 
of hydrogen molecules is introduced into the vessel by means of a 
palladic tube. As soon as neutral atoms (or molecules) settle on the 
surface of the point they give up an electron and then, as positive 
ions, move toward the screen. 

7 By means of such an ion projector, it has been se to obtain 
the image of a tungsten point itself. An image arises owing to the 
fact that adsorption of atoms occurs in specific parts of a tungsten 
bid Se ae 7 the obtained image, it was possible to discern the lattice 
FO, , the PeROHstION attained equalled 2-3 A, 


174. The Electron-Beam Tube 


An electron-beam tube is a widely used device, being an essential 
component of a television set, radar system and oscilloscope. The 
principle of operation of such a tube may be explained by means of 
the simplified diagram shown in Fig. 205. We see in the figure an 
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electron gun (/) and two condensers (2) for deflecting the electron 
beam in two mutually perpendicular directions. 

Let us consider the application of an electron-beam tube to the 
recording of rapid processes. If no voltage is applied to the deflecting 
plates, the electron beam is directed along the axis of the device and 
aspot appears on the luminescent 
screen. Assume that an alternat- 
ing voltage having a frequency 
greater than 20 cps is applied to 
the horizontal deflecting plates. 
Then, the electron beam _be- 
gins to oscillate in the vertical 
direction in synchronism with 
the varying field. Since electrons 
have very little mass, these os- 
cillations will have practically no inertia.* The motion of the beam 
is not perceived because the luminous spot moves too rapidly for 
the human eye to follow; moreover, the screen has an afterglow. 

Now, let us consider the second pair of plates, which provide the 
so-called “sweep”. A saw-tooth voltage is applied across these plates. 
Since this second pair of plates deflects the beam horizontally, the 
luminous spot moves, say, from left to right quite uniformly under 
the action of such a voltage. When the edge of the screen is reached, 
the luminous spot rapidly returns to its initial position and the proc- 
ess begins anew. By changing the frequency of the saw-tooth voltage 
_within a broad range of frequencies, we can vary the time scale of 
the horizontal sweep accordingly. 

If a sweep voltage is applied to the horizontal deflecting plates 
and the voltage being investigated is applied to the vertical deflect- 
ing plates, a curve of voltage vs. time will be obtained on the screen 
since the horizcntal coordinate of the luminous spot is proportional 
to the time reckoned from an arbitrary instant. 

An oscilloscope is particularly useful in the Tvesieslion of period- 
ic processes. It is always possible to select the sweep in such a man- 
ner that the curve described by the beam during one run from left 
to right coincides with the curve described during the second and suc- 
cessive runs. When the sweep period is fixed, we obtain a stationary 
curve of voltage as a function of time in a given time interval (from 
a fraction of a period to several periods). 

A saw-toothed voltage is produced by a self-sustained oscillatory 
process analogous to that described on p. 99 (the toppling of a tub of 





Fig. 205 


* This absence of inertia is determined by the axial velocity of the elec- 
trons. Therefore, to record very rapid processes, high voltage oscilloscopes are 
used. 
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water). The movement of the beam from left toright is produced by 
continuously and uniformly charging a condenser.* A discharge tube 
is connected to the terminals of the condenser. As long as the poten- 
tial difference across the tube is less than the ignition potential, the 
presence of the tube does not affect the charge on the condenser. 
When the potential reaches a critical value, the condenser rapidly 
discharges and the process begins anew. The saw-toothed oscillations 
must be synchronised with the periodic process under investigation. 

An electron-beam tube becomes more complex when a modulator 
is placed between the cathode and the anode. Such a modulator con- 
sists of a metallic cylinder, one end of which is covered with a dia- 
phragm containing an aperture equal to the sizeof the cathode. A nega- 
tive potential applied to the modulator makes it possible to control 
the intensity of the beam. At a certain value of voltage (the blanking 
voltage), the beam is completely cut off. Such blanking is necessary, 
for example, during the return trace of the beam. Thus, by means of 
the modulator, the trace of the beam is blanked during the return 
sweep. 

Two variable quantities may be viewed simultaneously on a screen 
if an electron-beam tube is equipped with an electron switch that al- 
ternately connects the deflection mechanism in one measuring cir- 
cuit and then in another. Double-beam oscilloscopes have been de- 
veloped for this same purpose. Such an instrument is equipped with, 
an electron-beam tube having two independent electron projectors 
and deflecting systems. A double-beam oscilloscope has in addition 
two separate amplifiers for the voltages being investigated and two 
saw-toothed voltage generators. 

The choice of a proper luminescent screen for an electron-beam tube 
is of great importance. For certain purposes, long-persistent screens 
are desirable, while for others it is required that the luminosity 
disappear as soon as the beam is switched off. 

Single-pulse processes may be recorded with an electronic oscil- 
loscope if it is equipped with an auxiliary camera the shutter of which 
is synchronised with the sweep. This makes it possible to photograph 
the screen at the required instant. 


175. Mass Spectrograph 


The fundamental equation of motion of a charged particle, 


ar 


m Tee (E +7 (0B), 








* A condenser charges and discharges exponentially, but by using a small 
portion of the exponential curve these processes may be made quite linear. 
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shows that the path of a charged particle is determined by— , the 


ratio of the charge of the particle to its mass. Therefore, measure- 
ments of the deflection of a charged particle in an electric and a mag- 


netic field may beused to find <= . Since the initial velocity of a par- 


ticle is not known, — cannot be determined by measuring the deflec- 


tion in either an electric or magnetic field alone. The general formu- 
las for deflection in electric and magnetic fields (Sec. 167) show that 


the path is determined by coefficients containing — and the initial 





B-direction away from us 


Stream of. 
positive 
charge 


Fig. 208 


velocity. The problem is solved by measuring the deflection of one 
and the same particle in an electric and a magnetic field. 

In the simplest case, it is sufficient to balance the electric and mag- 
netic deflections. For this purpose, the fields should be oriented as 
shown in Fig. 206. Charged particles will not be deflected when the 


following condition is satisfied: eE= + eH. This experiment enables 


us to determine the velocity of a particle. Now, it is merely necessary 
to measure the deflection produced by the electric field or by the mag- 
netic field alone. Knowing the magnitude of the detesuign of a par- 


ticle from its rectilinear path, we may calculate = — 
Measurements of — are of great importance in iets physics 
as a means of determining the mass of a particle when the charge is 


known. This pertains particularly to the determination of the mass 


of ions. 
An instrument in which the particles of a beam may be sorted ac- 
cording to mass, and the composition of the beam according to mass 
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may be investigated, is called a mass spectrograph. The mass spectro- 
graph proposed by Aston is represented schematically in Fig. 207. 
Its principle of operation may be explained as follows. Particles of 
various velocities are introduced into the electric field of a condenser. 


Consider a group of such particles having the same — ratio. Upon 


entering the electric field, a stream of these particles will be divided 
_ Since fast particles are deflected less than slow ones in an electric 





Fig. 207 


field. Now, this spread of particles is introduced into a magnetic 
field (perpendicular to the page). The sense of the field is such that 
the direction of deflection of the particles is opposite to that in the 
electric field. Here, too, fast particles are deflected less than slow 
ones. Hence, it follows that at some point beyond the field the divid- 
ed beam of particles will again gather at a point, i.e., become fo- 
cussed. : 

Particles having a different — ratio will also gather at a point, 


but not at the same one. Calculations indicate that the foci of all 


<— lie approximately on a straight line. If a photographic plate is 


placed along this line, each group of particles will be represented by 
a separate line. 

If a mass spectrograph is constructed with great accuracy, its re- 
solving power will be extremely high and it can be employed to detect 
the presence of very close isotopes. At first glance, such precision may 
appear unessential since it may be reasoned that the masses of iso- 
topes differ by at least one atomic weight unit. But while it is true 
that isotopes of one and the same chemical element differ by an atom- 
ic weight unit, isotopes of different elements (e.g., S** and A*’) 
may difier very little in mass. Moreover, it is important tobe able 
to determine the mass of complex ions. Such problems arise, for 
example, in connection with the chemical analysis of gas mix- 
tures. Different particles may then turn out to be close in mass, e.g., 
CH, and N™* or N"*H, and O'°. All such problems may be solved 
by means of a mass spectrograph. 
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176. Accelerators of Charged Particles 


Actually, all such devices as electron tubes, X-ray tubes and elec- 
tron guns are accelerators of charged particles, but this term generally 
denotes installations producing streams of charged particles (elec- 
trons, protons, deuterons, etc.) moving with velocities close to the ve- 
locity of light. Such streams of particles are then allowed to impinge 
on matter. The interaction achieved may be used for various pur: 
poses: investigation of nuclear transformations, production of radio- 
active isotopes, medical purposes, chemical action, etc. The role of 
accelerators in modern science is an extremely important one. 

It is possible, of course, to accelerate a particle to any desired ener- 
gy by making it pass through successive accelerating fields. How- 
ever, such a brute force approach, i.e., the construction of a linear 
accelerator, is not always practical. To accelerate particles to ener- 
gies of only tens of thousands of electron-volts requires a path length 
of the order of many centimetres. But modern physics is striving to 
obtain streams of particles with energies of tens of billions of elcc- 
tron-volts. Such a linear accelerator would have a length of tens of 
kilometres. Practically, therefore, such a solution to the problem is 
unacceptable. 

Lawrence was the founder of high-energy accelerators. His basic 
idea is that in a single installation particles should be accelerated by 

an electric field and repeated- 

Target ly made to return to the same 

accelerating gap by means of 

a magnetic field. The first ac- 

Source of celerators operating on this 

charged principle became known as ° 

particles cyclotrons. 

A cyclotron is represented 

diagrammatically in Fig. 208. 

The accelerating chamber may 

be pictured as a flat circular 

pill box cut along a diameter. 

Radio -frequency It is of large dimensions, 

Ee HEGE made of metal, and its two 

halves—the basis of the accel- 

erating chamber—areknown as 

Dees. An alternating electric 

field of period T is applied to the Dees. This entire system is placed 

between the poles of an electromagnet, which creates a strong con- 
stant magnetic field inside the box perpendicular to its base. 

The magnetic field intensity is determined by the period of the elec- 
tric voltage. That is, the value of the field intensity must be such 
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that the period of revolution in this field (expressed by the formula 
T= aie ) is equal to the period of the electric field. When this con- 
dition is satisfied, charged particles entering the accelerating 
chamber are captured by the fields and accelerated spirally with con- 
stant period 7. Thus, a particle in the gap between the Dees is accel- 
erated, traverses half a circle in the magnetic field and arrives at 
the accelerating gap just when the voltage phase changes by 180°. 
Thereupon, the particle is accelerated in the same direction and en- 
ters the other Dee where it traverses half a circle of larger radius. 
Each time the particle passes through the gap its velocity increases. 
The particle must be deflected from its circular path to eject it from 
the cyclotron. 

A cyclotron has limited usefulness. As the velocity of a particle 
increases its mass increases and, hence, its period of revolution in 
the magnetic field also increases. As a result, the particle begins to 
lag, i.e., it arrives at the accelerating gap when the phase of the volt- 
age has changed by more than 180°. This lag increases until, finally, 
the electric field not only does not accelerate particles, but retards 
them. Calculations indicate that the maximum energy that a cyclo- 
tron can transmit to a charged particle is given by the formula 

D) eUyinc® 
wt 








. For protons this amounts to 22 million electron-volts 


(22 Mev), and for a-particles about three times more. New means 
are necessary to attain higher energies. 


177. Auto-Phasing 


Veksler in the Soviet Union and McMillan in the United States 
elaborated a new concept in the accelerator field in 1944 and 1945, 
respectively. This concept may be summarised as follows: Examina- 
tion of the formula for the period of revolution of a particle shows 
that increased mass may be compensated for by an increase in mag- 
netic field. If such compensation takes place, the period of revolution 
of a charge will remain constant. 

Let us assume that the magnetic field intensity increases during 
the operation of the cyclotron. Among the thousands of millions of 
particles moving in the accelerating chamber, there are undoubtedly 
some particles whose increase in mass due to increased velocity is 
exactly proportional to the increase in magnetic field. Hence, the pe- 
riod of revolution of such particles will remain unchanged. Calcu- 
lations indicate that under such conditions other particles will also 
not fall out of synchronism. The only difference, which is quite un- 
important in practice, is that the energy of these particles will not 
increase in a monotone manner, like the energy of the “lucky” 
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particles whose increase in mass is completely compensated for by 
the increase in magnetic field, but will fluctuate about the energy 
value of the “lucky” particles. 

The orbit radius of the “lucky” particles agrees with their energy. 
This is the reason for their “good fortune”. Now, let us consider a 
particle the energy of which is greater than that required for a given 
radius. Such a particle will be retarded owing to an excess of mass 
increment. On the other hand, if a particle has less energy than re- 
quired for a given radius, the mass increment will be insufficient and 
the particle will be accelerated relative to other particles at thesame 
radius. Thus, since the mass of a particle increases with velocity, 
particles can, so to speak, regulate their own velocity and select volt- 
age phases which serve to correct their motion. That is why this 
phenomenon is referred to as “auto-phasing”. 

It transpires, therefore, that it is possible, in principle, to increase 
the velocity of particles in a cyclotron without limit if the magnetic 
field intensity is gradually increased. In such an installation par- 
ticles must be accelerated in pulses. When the field increases particles 
are accelerated, but the reverse cycle is idle. 

The above method is not the only way to achieve auto-phasing. 
Another approach is to slowly vary the period of the electric voltage. 
The principle is basically the same: an increase in the mass of a 
charged particle results in an increase in the period of revolution in 
the magnetic field; in this case, the regime of the alternating electric 
field is varied so as to compensate for this increase. An accelerator 
in which the period of the electric voltage is slowly increased is known 
as a synchrocyclotron. The path of a particle in a synchrocyclotron is 
a flat spiral. The farther this spiral extends, the greater the energy 
of the particle. Thus, an increase in energy is associated with an in- 
crease in the area of the accelerating chamber in the magnetic field. 
The most powerful accelerator of this type is the synchrocyclotron of 
the U.S.S.R. Academy of Sciences. This accelerator yields a beam of 
680 Mev protons. Its magnet weighs 7,000 tons. The energy limit of 
a synchrocyclotron is of the order of hundreds of Mev since a further 
increase in energy would result in an unthinkable increase in magnet 
weight. Particle energies of thousands of millions of electron-volts 
(Bev) are attained by means of proton synchrotrons. 


178. Proton and Electron Synchrotrons 


Proton synchrotrons are basically different from cyclotrons. Since 
a proton synchrotron accelerates particles in a single circular orbit, 
the volume of the magnetic field may be greatly reduced. The entire 
central region of the magnet is, so to speak, cut out. As a result 
much less steel is required for the magnet. Thus, the electromagnet 
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of the 10-Bev proton synchrotron of the U.S.S.R. Academy of Sci- 
ences weighs 36,000 tons, but the electromagnet of a 10-Bev syn- 
chrocyclotron would weigh about 1 million tons. 

To accelerate particles in an orbit of constant radius, one must 
vary the period of the accelerating electric field and the intensity of 
the magnetic field in avery definite manner. In such an installation, 
particles are accelerated in pulses. Each pulse is obtained by increas- 
ing the magnetic field and decreasing the period of the accelerating 
electric voltage in a prescribed manner. 

For a given orbit radius, a unique relationship exists between the 
field intensity and the velocity: 


- Myd ae 
as V1 a 
~ oe 


and the relationship between the period of revolution and the veloci- 
ty is determined by the expression 


T= 2Qur : 
v 





If these conditions are satisfied, a “lucky” particle will be accelerated 
in a monotone manner. 

Since the auto-phasing conditions occur here too, the remaining 
particles follow a path which oscillates about the circular orbit and 
they also take part in the synchronous increase in velocity. Since 
particles oscillate about an average circular orbit, it is necessary 
to make the width of the pathway for the charged particles rather 
broad. Methods are being sought which would allow us to reduce the 
width of this pathway. Success would lead to a reduction in the amount 
of steel required for a given accelerator and make it practical to 
construct accelerators with even higher particle energies. 

Thus far we have been discussing accelerators of heavy particles. 
Electron accelerators have a number of special features. 

As far back as 1941, an accelerator known as a betatron was con- 
structed to accelerate electrons. Such an installation operates on the 
principle of a transformer. The winding of a magnet constitutes the 
primary winding and a beam of electrons revolving in a circle of con- 
stant radius constitutes the secondary “winding”. In other words, the 
electrons move along a circular line of force of the rotational electric 
field produced by an alternating magnetic flux. 

At first glance, it appears that such acceleration may continue 
without limit. The increase in mass with velocity does not limit the 
acceleration since there is no need for synchronism in this phenome- 
non. Nevertheless, a betatron does havea limit. At energies of sever- 
al hundred Mev, energy losses due to radiation become considerable 
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—an accelerated electron radiates electromagnetic waves. As a result 
of this radiation, the electron path is transformed from a circle to an 
inward spiral and acceleration becomes impossible. Betatrons are 
useful when electrons with energies from 20 to 50 Mev are required. 
Ii greater energies are desired, electron synchrotrons must be used. 
Such accelerators were first proposed in 1946-47. 

The electron synchrotron is similar to the synchrocyclotron de- 
scribed above, i.e., it isaresonance accelerator. An accelerating elec- 
tric field is added to the magnetic field of the betatron. The accelerat- 
ing mechanism is maintained by auto-phasing. But the fact that we 
are dealing with lighter particles, namely, electrons, simplifies the 
construction problem. This is because at energies of only 2 to 3 Mev 
the electron velocity is almost equal to the velocity of light. Hence, 
when the energy increases further, the radius of the electron path does 
not change. This makes it possible to construct the magnet in the 
form of a ring as in thecaseof the proton synchrotron. The radiation 
losses of the electron synchrotron are compensated for by increasing 
the accelerating voltage. 

At high-energy levels, radiation losses reach imposing values. In 
4 300-Mev accelerator having an orbit radius of 1 metre, an electron 
radiates 1,000 ev per revolution. In a 10-Bev electron synchrotron 
having an orbit radius of 20 metres, the energy losses per revolution 
would be equal to 30 Mev. 
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similar to an electron microscope. In fact, one may use an electron microscope 
to obtain diffraction electronograms. For this purpose, it is merely necessary 
to remove the lens. 

Electron diffraction is used for the same purposes as X-ray diffraction, but 
electronography has a number of advantages over roentgenography. The main 
advantage is the short exposure time required. Matter scatters electrons much 
more effectively than X-rays. An electronogram may be obtained in a period 
of time measured in seconds, while a roentgenogram requires at least several 
minutes. 

Since electron beams do not penetrate matter to any extent, electronography 
may be conveniently employed to investigate the structure of surfaces. Elec- 
tronography may also be used to study the distribution of atoms in crystals, 
assuming, of course, that the structure is not complex. 


We are not so much interested in the applications of electrono- 
graphy as in electron diffraction itself. It is of great importance to 
determine the wavelength of an electron beam. This may be done ex- 
perimentally. When the accelerating voltage U is varied, the wave- 


length varies. It turns out that A is inversely proportional to VU. 
Thus, when A is expressed in Angstroms and U in volts, 


12.25 
N=. 
VU 


As far back as 1924, before the discovery of electron. diffraction, 
Louis de Broglie advanced a bold hypothesis. He proposed that the 
concept of the dual nature of an electromagnetic field, its manifesta- 
tion in the form of a wave of frequency v and wavelength 1 as well as 
in the form of particles (photons) with an energy E =Av and a momen- 
tum p=v= 4 , Should be broadened to include particles of mat- 
ter. Experiments in electron diffraction confirmed this hypothesis. 
The formula for the electron wavelength 

h A 


pmo 


is easily converted into the form 


* 
jE , where const* = eS ; 
VU V 2em 





by means of the relatio Ot] or v= V Ze. 


2 m 
Substituting the values of e, m and h in the CGS system of units, 
and converting U into volts, we obtain the above value for the con- 
stant. 
By means of the above formula, one can obtain the following electron wave- 
lengths for the indicated values of accelerating potential: 
U (volts) 1 100 1,000 
4 (A) 12.25 1,22 0.39 


Sec. 180] The Fundamental Concepts of Quantum Mechanics 47] 


Since diffraction from a crystal is possible when the wavelength is of the 
same order as the period of the crystal lattice (~1A), it is seen that electrons 
the energies of which are of the order of a hundred electron-volts may be diffract- 
ed from a crystal. 


180. The Fundamental Concepts of Quantum Mechanics 


As we have seen, an electron beam may behave as a wave having 
a wavelength has, Is this behaviour typical of a large group of 


electrons or are wave properties inherent in single electrons? To 
answer this question, let us compare the electronograms obtained 
with a low and a high intensity beam of electrons. In one such exper- 
iment, the beam was of such low intensity that the average time in- 
terval between two successive passages of an electron through the 
diffracting system was 30,000 times greater than the time taken by an 
electron to travel from the filament to the photographic plate. Never- 
theless, the diffraction pattern differs in no way from another elec- 
tronogram obtained with a beam the intensity of which was 10’ times 
greater. Such experiments show that the wave property is inherent 
in single electrons. Thus, what was stated on pp. 426-27 with respect 
to a photon is also valid for an electron. An electron does not behave 
like a projectile or pellet. The behaviour of an electron cannot be 
described by means of Newton’s laws of mechanics. The field devoted 
to the investigation of the behaviour of microparticles is known as 
quantum mechanics. 

This dual behaviour is not peculiar to an electron alone. Wave 
behaviour is characteristic of all microparticles. Thus, it is possible, 
for example, to observe neutron diffraction. “Diffraction of helium 
atoms and hydrogen molecules have also been observed. As will be 
seen below, the greater the mass of a particle the more do its wave 
properties recede into the background. But more about this later. 
Let us designate by p the amplitude of the wave associated with a mi- 
croparticle of mass m. This amplitude, like the amplitude of a wave 
of any kind, is a function of coordinates. The function (x, y, z) or 
“psi function”, must satisfy the wave equation (see p. 338): 


Ap + p=0. 


Let us substitute in this equation the microparticle wavelength 
A=, expressing the velocity of the particle in terms of its energy. 
If the particle moves in a field with a potential energy U and has a 
total energy @, then 


me =6—U and v= V 2—). 
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Substituting in the wave equation, we obtain 


Ay +r (€-U)y=0. 





This equation is called Schrédinger’s equation and is the fundamental 
law of quantum mechanics. It should be noted that we have merely 
performed a substitution in the wave equation, which by no means 
constitutes a derivation of the fundamental law of quantum mechan- 
ics. The substitution should be viewed rather as a conjecture leading 
to the discovery of this equation. 

Much like Newton’s fundamental law of motion, the Schrédinger 
equation is a law of nature encompassing, as we shall presently see, 
an extensive range of phenomena.* 

This differential equation enables us, in principle, to find at all 
points in the region under consideration the amplitude p(x, y, 2) of 
the wave associated with a microparticle. 

How can the validity of the Schrédinger equation be verified? 
This may be done by means of a basic confirmation of the theory: 
the intensity of the p-wave at every point in space, i.e., the quantity 
*, is the probability density** of an electron at this point in space. 
As for the amplitude of the w-wave, it cannot be determined experi- 
mentally, like the field intensity vectors of an electromagnetic wave. 

The description of a particle by the p-function is not to be regarded 
as an incomplete, imperfect method of describing its motion. It would 
be incorrect to believe that it is solely due to the peculiarities of 
quantum mechanics that a particle has a probability p* (x, y, z) of 
being at a given point in space and that by means of a better theory 
the path of the particle could be determined and its location indicat- 
ed with certainty. An exact description of the motion of a particle, 
i.e., the determination of its path, is not possible because the behav- 
iour of a microparticle is completely different from that of a large 
body. A microparticle is not a particle in the classical sense. 

Let us again refer to the experiment with {wo slits which was used 
to illustrate the dual behaviour of a photon. 

Assume that a beam of electrons impinges on a screen in which two 
slits are cut. Diffraction occurs. Let us direct our attention to one 
point on the screen where, say, interference annuls the wave. If only 
one slit is kept open, electrons reach this point. If both slits are open, 
however, electrons do not reach this point. This phenomenon cannot 
be explained by the collective behaviour of the electrons. 


* We have simplified the picture by not considering the dependence of p 
on time. The exact Schrédinger equation takes this dependence into account. 

** That is, the probability of finding the particle in a small volume divid- 
ed by the magnitude oi this volume. 
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If we insist on the electron’s behaviour as a classical particle, it 
must be accepted that an electron passing through one slit “knows” 
whether the other slit is open or closed, and behaves accordingly. 
It must be concluded that an electron is not a classical particle. 
Every electron has wave properties and the concept of an electron 
path is not fully applicable. Therefore, the question whether the 
electron passed through one slit or the other when both slits were 
open is meaningless. Such a question is meaningful only for “ordi- 
nary” particles, but not for microparticles. 

Does this mean that it is meaningless to speak of the velocity and 
coordinates of a microparticle? This question is answered by the so- 
called indeterminacy (or uncertainty) principle formulated by the 
German physicist Heisenberg. 


181. The Uncertainty Principle 


This principle indicates the limits within which the classical 
description of particles applies to microparticles. 

Applicability of the Path Concept. Let us assume that we wish 
to determine the coordinate of a microparticle at some point x and are 
able to do this with an accuracy of Ax. To “see” a microparticle, one 
must use a “microscope” in which the wavelength of light is suffi- 
ciently short (the shorter the wavelength the greater the resolving 
power). In principle, Ax may be made as small as desired. For this 
purpose, it is merely necessary to reduce the wavelength until Ax 
is of the same order of magnitude as the wavelength: Ax=A. 

However, if the wavelength is short, this means that the correspond- 


ing photon has a large momentum: p =+. This momentum will be trans- 


mitted to the particle being “observed” under the microscope, i.e., 
when the particle is “flicked” its momentum changes by a Ap the or- 
der of magnitude of which is 4. By decreasing A, we decrease Ax, 
the uncertainty of the coordinate, but at the same time we increase 
Ap, the uncertainty of the momentum. Eliminating 4 from the fol- 


lowing relations: Ax A and Ap=+ , we obtain the equation 

Ax xAp ~h, 
the expression for the uncertainty principle. This indicates that spec- 
ifying the path of a microparticle has physical meaning only if it 
is understood that the coordinates and momentum in a given direc- 
tion have uncertainties which satisfy the inequality 


Axx Ap>fh. 
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This remarkable relation establishes the limits of applicability of 
classical physics to a microparticle. 

Substituting the product mv for the momentum p, which is justi- 
fied in the case of velocities which are not very close to the velocity 
of light, we obtain the condition . 


A 


a relation between the uncertainties in a coordinate and the corre- 

sponding particle velocity along the x-axis. The right member of the 

relation will vary by many orders of magnitude, depending on whether 

we are dealing with an electron, atom, molecule or tennis ball. 
For an electron 


h 
— = 7 cm’/sec, 
m 


hence, the uncertainties in a coordinate and the corresponding par- 
ticle velocity are related as follows: 


Ax x Av>7. 


Consider an electron located within an atom, the diameter of which 
is 10-*cm. Can the motion of the electron in the atom be described as 
if the electron were an “ordinary” particle? Using the uncertainty 
principle, we find that Av~10* cm/sec. Thus, we can only speak of 
the velocity of an atomic electron in very general terms. The concepts 
of an electron path in an atom, an electron path of transition from one 
energy state to another (see below), etc., are meaningless. In short, 
an atomic electron is quite different from an “ordinary” particle. 

Now, let us assume that an electron has entered a Wilson cloud 
chamber and that we wish to trace its path with an accuracy of the 
order of several tenths of a millimetre. If the width of the particle 
track is Ax=10-? cm, then according to the uncertainty principle 
Av=7 m/sec. This is the uncertainty of the lateral component of 
the velocity. Even if the electron velocity is only 1 km/sec, an un- 
certainty of the indicated order is insignificant and specifying the elec- 
tron path becomes meaningful. Similarly, we can speak of the path 
of an electron beam in a microscope and the path of electrons in an 
electron-beam tube without coming into conflict with the “classical” 
picture. 

The mass of protons, neutrons, atomic nuclei and atoms is thou- 
sands of times greater than that of an electron. Therefore, the classi- 
cal paths of such particles are of somewhat greater usefulness. Thus, 
for example, in the case of an a-particle, the mass of which is about 
7,000 times greater than the mass of an electron, 


Axx Av > 107%. 
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Is it meaningful to ask at what location in an atom did the path 
of an a-particle penetrating a substance pass through the atom? 
We wish to trace the path with an accuracy of 10~°cm and we have 
at our disposal data on the lateral component of the velocity the 
uncertainty of which is 10° cm/sec =10 km/sec. For a fast a-particle 
(20,000 km/sec), this uncertainty is insignificant. Therefore, it is 
possible to say whether the path of the a-particle penetrating the 
atom passes far from the centre of the atom or not. 

On the other hand, it is meaningless to speak of the path of pro- 
tons or neutrons in nuclei, since the size of a nucleus is ~10~** em. 

For large molecules, e.g., proteins having a molecular weight of 
the order of 10°, the uncertainty principle is of no significance. 
Here, Ax x Av> 107’; hence one can reliably define the path of such 
a molecule in considerable detail. Even the random thermal motion 
of such a molecule, the average velocity of which is of the order of 
I m/sec, may be traced, and the path of its centre of gravity indicated 
with an accuracy of the order of 1A. 

Needless to say, a particle of dust, even if it is visible only under 
a microscope, is too large for the uncertainty principle to be of prac- 
tical significance. 

The Simultaneous Measurement of Two Physical Quantities. 
It should not be thought that the inability to determine the path of a 
particle is due to a measuring deficiency which will eventually be 
overcome by physicists. The lack of meaning in specifying a pair of 
physical quantities with ideal exactness is a peculiarity of micropar- 
ticles. The methods of describing the behaviour of an “ordinary” 
particle are inapplicable to a microparticle. Only for a classical par- 
ticle is it meaningful simultaneously to specify and define its coordi- 
nate and momentum. 

The principle of uncertainty has broader significance than being 
simply a means of judging whether or not the path of a particle can be 
determined. As an integral part of the mathematical apparatus of 
quantum mechanics, the principle of uncertainty enables us to eval- 
uate the possibilities of simultaneous measurement of any physical 
quantities, not only coordinate and momentum. 

First, let us define the uncertainty principle in relation to coordi- 
nate and momentum. It should be recalled that a particle has three 
coordinates and that a momentum vector has three components. 
Instead of the one relation discussed above, three relations should 
be written, namely: 


Ax x Ap,>h; AyxAp,>h, AzxAp,>h. 
The possibility of simultaneously determining (specifying) all the 


coordinates along the different axes, and all the momentum compo- 
nents, is also considered in quantum mechanics. It turns out that it 
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is possible (meaningful) simultaneously to specify the coordinates 
or simultaneously specify all three momentum components. Why 
is this point being emphasised? It would seem that it is always pos- 
sible simultaneously to determine the three components of any 
vector. Careful consideration indicates that this is not so. An example 
of a vector the three components of which cannot be determined 
simultaneously is the angular momentum (i.e., moment of momen- 
tum) of a particle. 

Assume that a particle rotates about an axis with an angular mo- 
mentum L. This motion may be viewed as the resultant of three ro- 
tations about three mutually perpendicular axes with angular mo- 
ments L,, L, and L,. Inthe case of an “ordinary” particle, the three 
components of the angular momentum may be determined separately 
since the path of the particle may be traced. In the case of a micro- 
particle such a determination is not possible, and simultaneous speci- 
fication of all three components of the angular momentum is meaning- 
less. To clarify this point, let us assume the converse for a moment, 
namely, that all three components of the angular momentum are 
known. But then the total angular momentum could be constructed 
from the three components. In that case, the plane in which the par- 
ticle moves is determined. But if this plane is known, then we know 
precisely the coordinate of the particle along the axis of rotation 
and note simultaneously that the velocity of translatory motion along 
the axis of rotation is equal to zero. This contradicts the principle of 
uncertainty relating coordinate and momentum. 

Thus, it’ is characteristic of a microparticle that it is not possible 
simultaneously to determine the three components of its angular mo- 
mentum. What data relative to its rotation may be specified simul- 
taneously? The uncertainty principle gives the following answer: 
any component and the absolute value (vector length) of the angular 
momentum. There is one exception to this rule: complete absence of 
rotation may be established for a microparticle, i.e., the angular 
momentum vector may equal zero; in other words, all three compo- 
nents equal zero simultaneously. 

Energy and Time Interval. On the basis of the uncertainty prin- 
ciple relating thecoordinateand momentum of a particle, one may 
suspect that a more or less analogous relation which involves energy 
exists. Thus, in ordinary particle mechanics, it was necessary to 
know simultaneously the position and velocity of a particle in order 
to calculate the total energy as the sum of potential and kinetic 
energy. For a microparticle this is not possible. However, the total 
energy of a particle may be found as a whole, i.e., without separa- 
tion into parts, and on the basis of what was just said it is natural 
to expect that this may be done with an uncertainty A@. If it is 
assumed that the uncertainty principle maintains its form, then 
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from dimensionality considerations we must conclude that the un- 
certainty relation for energy must have the following form: 


AGAt = h 


where At is the time interval. 

What is the significance of this time interval and how should the 
uncertainty relation for energy be interpreted? The time interval 
At is the time during which a microparticle possesses an energy 
6 +A@. The uncertainty in the energy of a microparticle is deter- 
mined by the time during which it is in the given energy state. The 
uncertainty in energy becomes significant only when the time during 
which it is at the given energy level is measured in minute fractions 
of a second. 

An atomic electron remains for an indefinitely long period of time 
at its lowest, or fundamental, energy level (see below for a detailed 
discussion). Therefore, the energy of the fundamental state is fixed 
quite rigidly. An electron remains for a very short period of time 
at a higher level. Its energy in such a state is G4A@. Accordingly, 
when an atom passes from a higher energy level to a lower one, the 
radiation frequency cannot be exactly defined, i.e., it lies in the nar- 


row band vA | This may be observed experimentally: spectral 


_lines are of finite width, which may be used to determine the so- 
called lifetime of a microsystem in an excited state. Experiments 
show, for example, that the width of spectral lines in the X-ray 
region is of the order of 10 ev. Thus, in such a case, the lifetime in 
an excited state is of the order of “4 ~10-" sec. 


AS 


182. The Potential Square Well 


On page 55, we discussed potential curves, which clearly illus- 
trate the conditions of particle motion. The simplest curve of this 
type is a right-angled curve, a so-called square well. In such a well, 
the potential energy has a constant value over a segment a (for 
simplicity, we restrict ourselves to the linear case). At the edges of 
the well, the potential energy changes abruptly. If the walls are 
very high, the potential energy inside the well may be considered 
equal to zero (since the choice of origin is immaterial), and at the 
edges of the well equal to infinity (see Fig. 210). 

Assume that an electron (or some other particle) is located in the 
well. Let us try to determine the nature of its motion for the simple 
one-dimensional case, i.e., let us assume that the electron is moving 
along the x-axis. If Newton’s laws of mechanics were applicable to 
an electron, such an electron would move continuously—first 
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toward one side of the well, where it would be elastically reflected 
from the wall, then toward the other side, etc. There is no other 
possibility from the viewpoint of Newtonian mechanics, since for 


U=0 the kinetic energy sin is constant. Thus, for motion in a square 


well, the following conclusion may be drawn from the mechanics 
of “ordinary” particles: A particle 
may move in the well with any 


‘ : mo? ‘ A 
kinetic energy —5- oF it may remain 


motionless. For any given energy, 
the motion is uniform—first toward 
one side, then toward the other, 
i.e., the velocity reverses direction 
abruptly at the end of the allowed 
interval. 

Now, let us consider the electron 
motion in such a well from the 
viewpoint of quantum mechanics. 

The behaviour of the electron is 

‘Fig. 210 characterised by the -function. 

The square of this function indi- 

cates the probability of finding the electron at some point in the 

given interval. Since U=0 inside the well, the Schrédinger equa- 
tion becomes simplified and may be written in the form 





ey A p=0, where Ave : 
This equation is satisfied by the sine and cosine of the argument 
aa If the well is bounded by the coordinates x0 and x=a, then 
for these values p=0, since the electron does not penetrate the walls. 


Therefore, cos on is not suitable as a solution to the equation 


(cos 2m val at x=0). Hence, 


p= Asin a x. 
But the wavelength 4 cannot be arbitrary, since =O at x=a. The 
following equation must, therefore, be satisfied: 
= a= (n + 1) ALA 
or 


2a 
A= oT? where n=0, 1, 2,.... 
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Thus, the wavelength may assume the values 2a, a, aa $ ... It 
is seen that the p-function represents the amplitude of a standing 
wave (see p. 131) and that formally this problem has much in common 
with that of a vibrating rod or string. But if the wavelength 4 has 
a discrete set of values, then the energy @ of a microparticle cannot 
be arbitrary, i.e., 

ean. 


sma” 


the integer n is called the quantum number. 
Thus, the Schrédinger equation leads to the quantisation of energy. 
A microparticle in a square well has a discrete set of energy levels. 


The lowest energy level occurs for n=0. This energy is equal to wo 


and is the zero-point energy of a particle located in a square well. 

The existence of a zero-point energy is an interesting peculiarity 
of microparticles. In the case of “ordinary” particles, the lowest 
energy has a value of zero. But under no circumstances can micro- 
particles cease to move. At absolute-zero temperature, a micro- 
particle has a specific zero-point energy, the values of which differ 
considerably depending on the nature of the force field in which 
the particle is located. 


Example. Assume that a=1A (a characteristic value for an atomic region). 
Then, the zero-point energy of an electron in the square well is 


nn (6.6 x 10727)? = 
Eo Smai = TRO TXI0 =F 10 Hs = 0-6 X 107" erg=37 ev. 


If a=1 cm (a free electron in a piece of metal), 8,=0.6X 107 *erg=37 X 107" ev. 


The velocity of an electron at a given energy level may be calculat- 
ed by means of the wavelength: v=. But in such a case, the elec- 


tron motion cannot be described by the equations of classical 
mechanics. It is not possible to indicate where the electron is located 
at one or another instant of time. However, the quantity »’, i.e., 
the probability density of the electron at one or another point in 
space, may be determined from the equation 


tp = A? sin? x= A? sin? A(n+l)x. 


It should be noted that to each energy level there corresponds 
a proper (of same number n) wave function (also called characteris- 
tic function or eigenfunction). 

Fig. 211 shows the w-function and w’-function for the first four 
energy levels of an electron located in a square well. Quantum 
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mechanics leads to the conclusion that an electron does not appear 
with equal frequency at different points in the region. If the electron 
energy is a minimum, i.e., the electron is located at the lowest 
level (n=0), the particle is most often encountered at the centre 
of the “well”; if the electron is located in the state corresponding 
to n=1, it is never encountered at the centre of the allowed seg- 
ment, etc. The p, curves indicate the frequency with which the elec- 
tron appears at various points in the region. 

Now, let us summarise. On the basis of quantum mechanics, the 
following conclusions may be drawn regarding the motion of a mi- 
croparticle in a square well. Only motion corresponding to a dis- 


v(x) hea 





Fig. 211 


crete set of energy values, @,, 61, G2 ..., is possible. The particle can- 
not be stationary since even the lowest energy level corresponds to 
motion with a certain velocity. Data on the nature of particle mo- 
tion for a given energy are provided by the p’-function. Knowing 
p(x), one may determine at what points in the region the micro- 
particle appears often, and at what points rarely. 

It remains to be determined under what conditions an “ordinary”, 
i.e., classical, description of particle behaviour is valid. 

Imagine that an oxygen molecule is enclosed in a box the dimen- 
sions of which are scores of times greater than the dimensions of 
the molecule. Assume that the molecule has the average energy, at 
room temperature, of a molecule of oxygen gas, i.e., 107** erg. 
Using the values a=100 A, m=5.4x10~** gm and 6 =107"* erg, 
one may determine the quantum level of the microparticle. The 
result is n=1,000. Two conclusions may be drawn. First, the wn 
curve has such a large number of alternating maxima and minima 
that the probability density of the particle is approximately the 
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same for all points in the box. Secondly, neighbouring energy levels 
are very close. 

Both characteristics which follow from the fundamental equation 
of quantum mechanics have disappeared: the probability distribu- 
tion of the particle is practically indistinguishable from a smooth 
curve and the energy levels are so close that energy discreteness is 
imperceptible. In such a case, quantum mechanics yields approxi- 
mately the same results as particle mechanics. This is true whenever 
the particle energy corresponds to a large quantum number. We 
have illustrated an important principle of quantum mechanics: 
when the quantum number is large, the results of quantum mechan- 
ics coincide with those of the mechanics of “ordinary” particles. 
This means that when n is large the particle-path concept and other 
ee of ordinary particles are applicable to microparticles 
as well. 


183. Significance of the Solution of the Schrodinger 
Equation 


We have devoted a relatively large amount of space to the motion 
of a particle in a square well. On the basis of this simple example, 
we were able to illustrate the basic features of the quantum-mechan- 
ical method of considering problems. If an electron or other particle 
is able to execute motion in a limited region, the characteristic 
features of the solution of the Schrédinger equation are preserved, 
no matter what the form of-the potential curve in this region. In 
all cases, the potential well may be intersected by a number of hori- 
zontal lines—possible energy levels. In principle, the Schrodinger 
equation enables us to calculate these energy values if.the form of 
the potential well is given. The lowest level gives the zero-point 
energy of a particle in a given potential well. 

For each energy level of number n, quantum mechanics establishes 
a set of wave functions »,(x, y, z). The quantity wpn(x, y, 2) gives 
the probability of finding a particle at a given point in the region if 
the energy of the particle is @,. Since the particle manages to be at 
all points in the region more than once during the time of meas- 
urement, wp” (x, y, z) may be viewed as the density of the “particle 
cloud”. The electron cloud surrounding an atomic nucleus resembles 
a photograph of the atom taken with long time-exposure. The 
ap-function gives the amplitude of the wave associated with the par- 
ticle. In the case of an electron in a square well, the w-function con- 
sists of standing waves and to every level there corresponds a 
characteristic wavelength A. 

This is not the situation in the general case. The standing “waves” 
corresponding to a given state (given n) will appear very strange: 
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their wavelengths as will differ at different points in 
the region, depending on the nature of the potential “curve” U(x, 
y, 2). For more or less complex cases, there is only slight similarity 
between the w-function and the amplitude of a standing wave (in 
the usual sense of the term). 

Theoretical and experimental evidence indicate that in a number 
of cases several w,-functions may correspond to a single @,, energy 
value. This occurs if at one energy a particle may have states which 
differ with respect to another physical quantity, e.g., angular mo- 
mentum. The forms of the ’ clouds of such a state—called a degen- 
erate state—may differ radically from one another. 

The problem of particle motion in a given type of potential well 
is solved when the energy levels are found and the characteristic 
ap-functions are calculated for all levels. If the solution of the Schr6é- 
dinger equation is known, the result of one or another measurement 
performed on the given particle may be predicted. 


184. Tunnelling Through a Barrier 


We shall now discuss a peculiar effect which may occur in the case 
of a microparticle, but not in the case of an ordinary particle. This 
is the tunnel effect, i.e., the “leakage” of a particle through a poten- 
tial barrier. 

Imagine that inside the region in which a particle moves, there is 
a potential barrier of height U and width d (Fig. 212). If the energy 
u of the particle is @€<U an ordinary. 

particle could be either in front of 
the barrier or beyond the barrier. The 
particle cannot pass through the bar- 
rier, since this would require that the 
particle have a negative kinetic ener- 
gy and an imaginary velocity, which 
is absurd. Matters stand differently 
with respect to microparticles. The 
x uncertainty principle does not per- 
mit us simultaneously to ascribe to 
a microparticle exact values of veloci- 
Fig. 212 ty and coordinate, and hence of kinet- 
ic and potential energy. That is why 

a particle having a total energy @ may pass through the barrier. 

The conditions for such passage may be determined as follows. 
Coordinate and momentum uncertainties are connected by the rela- 
tion AxAp=h. Momentum uncertainty is uniquely related to kinetic 
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energy uncertainty since K=2., If AK is a quantity of the order 
of U—@, where @ is the particle energy and U is the height of the 
barrier a particle to the left of the barrier (see figure) has a coordi- 
nate uncertainty 
Nise 
Ap Vim (Ue) 
If the barrier width d is less than Ax, the particle may be located 
on the other side of the barrier. The particle tunnels its way, so to 
speak, through the barrier at the total energy level @. 
Thus, the condition for tunnelling is 


dV 2m U—E)<h, ie. £Y2ImU—S)<1. 


It may be easily shown by numerical examples that the phenomenon 
is of significance only for microparticles. 

i For U—g=10 ev~107" erg, m~10~? gm (electron mass) and d~107*%cm, 
7 V 2m (U—@)=0.2<1, i.e., tunnelling is possible. 

‘ For a I-gm spherule lying next to a match box (U—g=3,000 ergs and d=2cm) 
ir V 2m (U— €)=2.5X10">1. It is evident that the spherule cannot “tunnel” 
through the match box. 


The probability of leakage through a barrier may be calculated. 
It turns out that this probability is proportional to 


ant rrr 
eR Vem (U-@) 4. 


The tunnel effect could be predicted on the basis of the Schré- 
dinger equation. The solution of this equation shows that even at 
points where U>@ the -function has values differing from zero. 
Thus, there is a certain probability—inversely proportional to the 
magnitude of U—€—that an electron is located in a region where 
in the language of “ordinary” particles it possesses negative kinetic 
energy. 


CHAPTER XXVIII 
ATOMIC STRUCTURE 


185. Energy Levels of a Hydrogen Atom 


A hydrogen atom has one electron which “rotates” in a nuclear 
field. One would think that the problem is simple. But even for this 
most simple atom, the solution of the Schrédinger equation is very 
cumbersome and, therefore, will not be reproduced here. However, 
we shall give the results of the calculations and discuss them in 
considerable detail. 

An electric force of Coulomb attraction acts between the electron 
and the nucleus. The potential energy of an electron in a nuclear 


fieldis Us —£ , where e is the charge of an electron (the same as the 


charge of a proton) andr=/x*+-y?+2* is the distance between the 
electron and the nucleus. 

The Schrédinger equation 
has the form 


Ap +E" (+5) p=0. 


Such an atom constitutes 
a peculiar kind of poten- 
tial well and is illustrated 
in Fig. 213. This is a well 
without a bottom and with 
divergent sides. The sides 
of the well are hyperbolas 
and the axis r=0 is one of 
the asymptotes. The elec- 
Fig. 213 tron inside the atom has a 
negative potential energy * 
since the minimum value of potential energy tends to infinity 
when r—0 and the maximum value is equal to zero. 
Fig. 214 shows the energy levels obtained from the solution of 
the Schrédinger equation. An important feature of the solution is 
the drawing together of the levels as the quantum number n increases. 

















* It may be asked: why has the origin of potential energy been chosen in 
such a manner that the electron energy is negative? The advantage of such a 
choice is not difficult to see. For different atoms, the potential energy has the 
same value only when r+ o. It is natural to set this common value equal to zero. 
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Fig. 214 


Transitions between levels will be discussed below. The scales of 
values, which are proportional to energy, are given in the units 
adopted in spectroscopy: volts and reciprocal centimetres. The 
energy level formula may be written in the form 
2?me* 
aaa Be 
For historical reasons, it is customary to write this formula in the 


form 
cRh 
On a Sai nm? 


20?met “1; 
where R= ae = 109,740 cm~? is the Rydberg constant. 
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Thus, it transpires that the total energy as well as the potential 
energy of the atomic electron is negative. The atomic electron may 
be located at any one of n levels. The greater the value of n, the 
higher the energy level, and the greater the energy possessed by the 
electron. The electron of a free hydrogen atom on which no force acts 
is at the lowest energy level: €,=—cRA. 

If an energy greater in magnitude than cRh is transmitted to the 
atom the electron leaves the bounds of the potential well, i.e., the 
atom becomes ionised. The energy cRA is called the ionisation energy. 

It is customary to characterise the work of tearing away an elec- 
tron from an atom by the ionisation potential, For a hydrogen atom, 


V7. ' ¢Rh 3¢10"5¢ 109, 740 X6.6X% 1072? - 
ion ~~ ~@ =————"Fexlo 
A wot statvolt = 13.5 volts. 


The reason for the above designation is the following. Let us 
assume that the electron is ‘torn away from the hydrogen atom by 
the action of a beam of electrons. To ionise a hydrogen atom, one must 
accelerate electrons, which act as projectiles, to an energy of at 
least eV =cRA. Therefore, V is the potential difference through which 
an electron must be accelerated in_order to produce ionisation of 
collision with a hydrogen atom. 

If the energy imparted to a hydrogen atom is less than cRA, a 
transition of the atom occurs to one of the n levels*. Such an atom 
is said to be in an excited state. 

An atom stays in an excited state for a small fraction of a second 
and then passes to a lower level with the emission of a photon in 
accordance with the equation _ 


WV inn =€¢,—6,=CRh (ar). 


If hydrogen atoms are excited by different kinds of collisions, 
they are raised to different energy levels and return to the ground 
state by “skipping” over levels (see Fig. 214). Therefore, a large con- 
centration of hydrogen atoms will radiate photons of every possible 

Vnn wrequency. A characteristic line spectrum of emission arises. 

By calculating for a given n the v,, irequencies corresponding to 
the numbers m=n-+1, n+2, ..., we obtain a series of frequencies of 
lines in the hydrogen syectrum. The existence of such series was 
known long before quantum mechanics was developed. The series 
corresponding to n=2 is known as the Balmer series. By substituf- 
ing m=3, 4, 5, 6, 7 and 8, respectively, in the formula, let us cal- 

* In the case of a hy drogen atom, which has one electron, the phrases * ‘the 


afom is at an energy level n” oe “the electron is at.an energy level 1 n” have the 
same meaning. | | - 
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Moreover, the Schrédinger equation shows that relative to the 
selected direction z the angular moment L must be oriented in such 


a manner that 
h 


on’ 


L,=ms5- 


where m is an integer that may assume any value from —/ to +, 
including zero. 

It should be recalled that according to the uncertainty principle 
L and L, give us all that we can possibly know about the angular 
momentum; in other words, it is meaningful to specify simultane- 
ously only these two quantities. 

Thus, the state of an electron in an atom is characterised by three 
quantum numbers: n, J and m. The number n is called the principal 
quantum number, J the reduced azimuthal quantum number, and m 
the magnetic quantum number. 

The states with /—0, 1, 2, 3, ... are designated by the letters s, p, 
d, f,..., respectively. The principal quantum number precedes one 
of the above letters. For example, the 3p state is the state with n=3 
and /=1. 

Let us list all of the possible states for n=1, 2 and 3: 


n | i | Designation of the state | m 


1 
2 


3 
3p —1, 0, 1 
3d —2, —1, 0, 1, 2 


Nr OF OO 
ive) 
a” 


The energy transitions of a hydrogen atom are determined exclu- 
sively by the values of the principal quantum number n. In order 
for the / and m numbers to play a part, the degeneracy must be 
“removed”, i.e., the energy of states with a different angular momen- 
tum must be changed. In the case of hydrogen atoms, this may be 
done by placing the atoms in a magnetic field. In other cases, degener- 
acy is removed by electron interaction (see below). 


187. The Electron Cloud of s and p States 


The state characterised by the three numbers n, / and m is de- 
scribed by the wave function wp, , ,- The characteristic form of the 
electron cloud corresponding to this state is determined by the func- 
tion tp; ),,- Let us consider the form of the »p*-functions of a hydrogen 
atom which characterise its various excitation states. 
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Consider the s states. Since /—0, m is also equal to zero. Hence, 
for every n there is only onep-function. The equation /=0 means 
that the electron has no angular momentum. This requires, of course, 
that there be no favoured directions of motion, i.e., the electron 
cloud must have spherical symmetry. Such is the result obtained 
from the Schrédinger equation: the functions ¥,,, ,,, W,,, etc., 
have spherical symmetry. 

Fig. 216 shows curves of radial density distribution of the electron 
cloud or, what amounts to the same, the probability density distri- 
bution of the electron. The quantity 4xr*p’, which gives the radial 





Fig. 216 


density, is plotted along the ordinate. It is evident that 4sr?\p"dr 
represents the number of electrons* contained in a spherical shell 
the inner and outer radii of which are r and r+dr, respectively. The 
radial density curves show that in the Is state there is one electron 
density maximum, which for a hydrogen atom is located at a distance 
of 0.53 A from the nucleus. In the 2s state, there are two density 
maxima; the electron will be within the second maximum moste 
of the time. Finally, in the 3s state, there are three density maxima; 
here, the electron will be within the third maximum most of the 
time. 


* A fractional number of electrons should not disturb us, for this is only a 
manner of speech. Strictly speaking, 4ur*p*dr is the probability of an electron 
being inside a spherical shell of dr thickness. 
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As the principal quantum number n increases, the electron cloud 
becomes dissipated. 

The p-state functions look entirely different. There are three val- 
ues of m, namely, 0, —1 and +1, corresponding to /=1. The elec- 
tron-cloud configurations are illustrated in Fig. 217. For m=O, 
the major axis of the “figure eight” is oriented along the selected di- 
rection; for m=+1, the major axis is perpendicular to the selected 





Fig. 217 


direction. It is evident that the m=+1 states may be meaningfully 
distinguished only when both are present. The figure gives some 
indication of the symmetry of the electron cloud. It is the same for 
all states. A change in the principal quantum number merely results 
in a change in the nature of the radial drop in density: the greater 
the value of n, the more extended the curve. 

We shall not discuss states with large values of J. Their electron 
clouds are more complex. 


188. Pauli’s Exclusion Principle 


Atoms are arranged in the Mendeleyev periodic table in accordance 
with the number of electrons contained in them. Thus, helium has 
two electrons, lithium three and beryllium four. On the basis of the 
Schrédinger equation, what can be said about the structure of atoms? 

At first glance the problem may appear hopeless. Even in the case 
of helium, strict adherence to procedure would involve solving the 
Schrédinger equation for a wave function with six variables, 
W(X1, Y4» 24» Xa» Yo» 2), the square of which gives us the probability of 
finding the first electron at point x,, y,, 2, when the second electron 
is at point x,, y,, 2g. The potential energy to be substituted in the 
equation is 
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where r, and r, are the distances of the electrons from the nucleus 
(the nuclear charge of helium is 2e) and rj, is the distance between 
electrons. An exact solution of such a problem is quite impossible. 

It would be extremely desirable to deal separately with each atomic 
electron and describe each such electron by its wave function 
p(x, y, 2). But how can this be done? Evidently, it is necessary to 
consider the motion of one electron in the field of the nucleus and the 
remaining electrons. It may be assumed that this effective field has 
spherical symmetry. Therefore, the description of the properties of 
such an electron will not differ from that of the electron of a hydrogen 
atom. 

To be sure, the problem is still quite difficult: for different elec- 
trons these effective fields differ and, moreover, all must be deter- 
mined simultaneously since each depends on the states of the remaining 
electrons. Such an ‘effective field is said to be self-consistent. This 
approach to the problem of a multi-electron atom enables us to apply, 
to a large extent, the description of the properties of a hydrogen 
atom’s electron to the behaviour of an electron of a complex atom. 

The state of each electron will be characterised by the same quan- 
tum numbers as in the case of hydrogen. However, in the case of a 
multi-electron atom, degeneracy is removed by electron interaction, 
and levels with different / and m values will have different energies. 

The Schrodinger equation enables us to determine the energy levels 
that are possible, but does not indicate the energy of the atomic 
electrons. One might think that all the electrons of an atom occupy 
the lowest energy level. In any case, such would be the behaviour 
of “ordinary” particles. But experiments completely refute such a 
supposition. The “arrangement” of electrons in accordance with energy 
levels is governed by the Pauli exclusion principle. The first conjec- 
ture that such a principle exists was based on a study of the Men- 
deleyev periodic table. 

As was indicated above, it follows from the Schrédinger equation 
that (2/---1) states exist for agiven and J. It was also indicated that 
this in turn yields n® different w-functions for a single value of n. 
The first values of n’ are 1, 4 and 9. Let us examine the Mendeleyev 
periodic table. Helium, neon and argon, which complete the first 
three periods of the table, contain 2, 8 and 18, i.e., 2n’, electrons, 
respectively. This is by no means accidental. It is an expression of 
a profound law according to which only two electrons can have the 
same p-function. In other words, an energy level cannot be occupied 
by more than two electrons. This general law of nature, to which we 
shall return in Sec. 190, is called the Pauli exclusion principle. 

By means of this principle, we can “arrange” the electrons of a. 
complex atom in accordance with quantum numbers and, therefore, 
in accordance with energy levels and values of angular momentum. 
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A helium atom has two electrons, which may occupy the single Is 
level. The third electron of lithium must be located at the next 
level, i.e., the 2s level. A beryllium atom has four electrons, which 
occupy the Is and 2s levels. The fifth electron of boron is at the 2p 
level. At this level, there are six places for electrons, which are all 
filled when neon is reached. But let us postpone consideration of the 
relation of the Mendeleyev periodic law to the electron structure of 
an atom until Sec. 192. 

Do two electrons occupying a level which is characterised by the 
same three quantum numbers differ in any way? It turns out that 
two such electrons differ with respect to the orientation of their inter- 
nal angular momentum (“spin” orientation). 


189. Deflection of an Atomic Beam in a Magnetic Field 


In the preceding articles, the angular momentum of an electron 
due to its motion about a nucleus was discussed in considerable 
detail. The presence of such angular momentum may be demonstrat- 
ed since an atom acquires a magnetic moment as a result of the 
motion of its electron. 

Let us employ classical concepts and assume that the electron 
revolves in a circle of radius r. Since current is equal to the charge 
transferred in a unit time, the equivalent current of such a revolving 
electron is J =ne, where n is the number of revolutions per second. 


On the other hand, n=5—, where v is the velocity. Thus, 


ve 
Ls 


2mr 


and the magnetic moment of the revolving electron (see p. 260) is 


Ce Sai xara evr. 
c ce 2nr 2¢ 


L, the angular momentum of an electron, is equal to mur. Hence, 
é 
M = Sme i. 


This relationship between the angular momentum and the magnetic 
momentum of an electron moving about a nucleus, obtained by 
means of the above simple calculations, has been experimentally 
confirmed for all atomic electrons. 

Thus, atoms for which L=40 possess magnetic moments, and it may 
be demonstrated in a suitable experiment that such atoms behave 
like smal! magnets. Fig. 218 illustrates such an experiment in which a 
parallel beam of atoms passes through a nonuniform magnetic field. 
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Air is carefully pumped out of a long tube. At the left end, an 
atomic gas is created. Atoms in the left compartment can get out 
through small apertures in thescreens. Since there are two apertures, 
only atoms moving along the axisof the apparatus will remain in the 
beam. This parallel beam of atoms passes through a nonuniform 





Fig. 218 


magnetic field. As was indicated on p. 284, in such a field the force 
acting on a body which has a magnetic moment M is 


dH 
f=M,q,> 


where nH is the field gradient in the direction perpendicular to the 


atomic beam and M, is the projection of the magnetic moment on 
the direction line of the gradient. If the magnetic moment is perpen- 
dicular to the field, no force acts on the body, but if the moment is 
directed along the field, the body is attracted to either the north or 
south pole, depending on the direction in which M is pointed. If 
the atomic beam contains atoms having different magnetic moments, 
or differently oriented magnetic moments, the beam of atoms will 
spread out: atoms travel in different directions and different forces 
act on them. The beam is allowed to impinge on a plate until enough 
atoms hit the plate to be perceptible. 

The above experiment has been of great importance in the develop- 
ment of basic atomic theory. It is still of importance as a method of 
determining the magnetic moments of atomic nuclei. 


190. Electron Spin 


By means of experiments with atomic beams, one can measure M 
and, therefore, L. One of the important deductions of quantum me- 
chanics pertains to the quantisation of angular moment L: the total 
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momentum L can assume only a discrete set of values, namely, 


La V e+e: 


where / is the azimuthal quantum number. Therefore, the magnetic 
moments of atoms also can assume only a discrete set of values: 


Me Gh =n Tae), 


4nme 
The coefficient in this formula, 








p= pe = 0.927 10-* erg/gauss, 


is called the Bohr magneton*. 

In experiments with atomic beams, the external magnetic field is 
directed perpendicular to the atomic beam. The projections of Z on 
this perpendicular line can also have only a discrete set of values: 

h 
L, =m on? 
hence, 
M,=mp, 


where m is the magnetic quantum number. It is seen that the values 
of M, must be equal to a whole number of Bohr magnetons. 

M, may be determined directly from experiments with atomic 
beams. What should be the result of experiments with beams of atoms? 
We may expect the following picture. Hydrogen, helium, lithium 
and beryllium have only s electrons. Since L=0 for these atoms, 
such a beam does not split up in a magnetic field. When p electrons 
are present, the beam may be expected to split up into three distinct 
components: an undeviated central component for m=—0, and two 
components arranged symmetrically to the right and left form=+1. 
For d electrons, we should obtain five beam components corre- 
sponding to the five possible values of the quantum number m, etc. 

These predictions are realised to the following extent: in certain 
cases a beam of atoms does not split up, while in others it splits up 
into distinct components. Thus, it is evident that some atoms have no 
magnetic moment; if an atom has a magnetic moment, it is quantised. 

As regards the results obtained for specific atoms, they point to 
a completely new fact, namely, that an electron has an internal 
magnetic moment. 

Such is the conclusion to be drawn from an experiment with a beam 
of hydrogen atoms: a beam of hydrogen atoms splits up into two 


* Niels Bohr (1885-1962) noted Danish scientist who developed the 
first theory of atomic structure. 
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symmetrical components, which correspond to the deflections of 
atoms with magnetic moments +p. There is no undeviated central 
component! This fact may be explained by the following hypothesis, 
which is supported by other data: an electron has a magnetic moment 
and there are only two possible orientations of this moment in space, 
namely, the projections +p on the direction line of the external field. 

The magnetic moment of an electron due to its motion about a 
nucleus is uniquely related, as we have just seen, to the angular 
momentum of the electron motion about the nucleus. It also tran- 
spires that the internal magnetic moment of an electron is related 
to its internal angular momentum, or spin. 

As far back as 1925, before the above experiments which show 
that an electron has an internal magnetic moment were performed, 
Goudsmit and Uhlenbeck suggested that an electron has a spin. 
These physicists showed that such a hypothesis—the existence of 
an electron spin, i.e., an internal angular momentum—removes 
insurmountable difficulties in deciphering spectra. At first, it was 
proposed that spin is a consequence of the rotation of an electron 
about its own axis (whence the origin of the term “spin”). But this 
interpretation is incorrect. Electron spin is a primary characteristic 
and is not reducible to something simpler. 

What is the relationship between the internal angular momentum 
(spin) and the internal magnetic moment of an electron? The exper- 
iment with a beam of hydrogen atoms leads to the conclusion that 
M,, the projection of the internal magnetic moment of an electron, 
may assume only two values, namely, + p. It may be assumed that 
L,, the projection of the spin, may also assume only two values. 

If the formula 


L=Vit+e 


is applied to the internal angular momentum of an electron, one 
finds that the number / has a single value. Thus, as quantum mechan- 
ics indicates, the values /=0, 1, 2, ... correspond to 1, 3, 5, ..., 
and in general (2/+1) states, respectively. To obtain two spin 
states, which the experimental results show to be the case (2/+-1 =2), 
one must assume that l=+ : 

The absolute magnitude of an electron’s internal angular momen- 
tum (spin) has the single possible value L= Vo. As for the 
projection of spin, assuming as before that the differences in the pos- 


sible values of L, must be multiples of ua , we see that it can assume 


only two values, namely, +5 i and — 4-3 . Thus, (L,),,= 6X a , 
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where § is anew quantum number (spin number) that can assume only 
two values, namely, +>. 


It was stated earlier that the hydrogen experiment led to deflec- 
tions corresponding to a magnetic moment of one magneton, and 


that M,—w. Since the quantum number 6 is equal to-. , therelation 


turns out to be incorrect for the internal motion of an electron. 
Agreement with experimental results is obtained if 
eé 
M po Lise 

Thus, eM 
electron motion about anucleus, is one half of the analogous ratio 
for internal motion of an electron. 

The existence of electron spin enables us to formulate the Pauli 
exclusion principle more clearly. There can be no more than two elec- 
trons in a state having quantum numbers n, / and m. Such electrons 
differ only with respect to spin projection. Experiments show that 
the spin projections of two such electrons cannot be the same. The 
Pauli exclusion principle may now be formulated as follows: There 
can be only one electron in a state characterised by four quantum 
numbers n, 1, m and 6. In other words, if there are two electrons in 
an n, I, m state, their spin directions are opposite to each other. 


the ratio of magnetic marnent to angular momentum for 


191. Magnetic Moments of Atoms 


The electron spin hypothesis makes it possible to interpret the 
results of experiments with atomic beams. The measurement of 
magnetic moment is one of the most important methods of determin- 
ing the electron state of atoms. Let us consider the first few elements 
of the Mendeleyev periodic table. 

We have already studied the hydrogen atom. What happens to 
a beam of helium atoms? Such a beam does not split up. This is as 
it should be. This atom has two electrons in the 2s state. The Pauli 
exclusion principle requires that their spins, and hence their magnet- 
ic moments, be oppositely directed; the total magnetic moment is 
equal to zero. 

In a lithium atom, the magnetic moment must be determined by 
the third electron since the actions of the spins of the first two elec- 
trons in the ls state annul each other. The third lithium electron 
is in the 2s state. Therefore, like in the case of hydrogen, the magnet- 
ic moment can be determined only by the spin. The splitting up of 
a lithium beam into two components shows that this analysis is cor- 
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rect. Like in the case of hydrogen, one component corresponds to 
a spin projection +, and the other to a spin projection — 
A beryllium atom has four electrons—two in the Is state and two 
in the 2s state. Since the spins of the electrons in each pair are oppo- 
sitely directed, they annul each 
other and yield a total moment of Boron 
zero. Therefore, a beam of beryl- 
lium atoms does not split up into Por eee ee ee 
components. m=] 4 
However, a beam of boron atoms —— 9 
splits up into four components. 
How can this be explained? It is 
evident that the magnetic moment ee 
is produced only by the fifth elec- m=0 
tron, which is in the 2p state. This 
state may be realised with three 
values of the magnetic quantum 
number, namely, +1, 0 and —1. 
Thus, the orbital magnetic moment 
may have three values, including 
zero. The values of the spin mag- 
netic moment must be added to those Fig. 219 
of the orbital magnetic moment. 
How is this done? Fig. 219 shows all the possible mutual orienta- 


tions of the moments. It is seen that four combinations of angular 
1 1 


momentum are possible: = ok eee and —3 (in us units, usually 
not indicated). 

The splitting up of a beam of carbon atoms is even more complex: 
seven lines, including one which is not deflected, with the following 
values of angular momentum: 3, 2, 1, 0, —1, —2 and —3. 


i) 
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Nin 


Nim 
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m=-1 
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192. The Mendeleyev Periodic Law 


The regularities of the Mendeleyev periodic system become un- 
derstandable when viewed in the light of available data on the elec- 
tron structure of atoms. 

The basic rules of distribution of electrons in an atom according 
to quantum numbers are derived from the Pauli exclusion principle. 
But if we were governed by this information alone, only the electrons 
of the first eighteen atoms (through argon) would be assigned correct 
quantum numbers. This may be explained as follows. Electrons fill 
the energy states of an atom in consecutive order. In a hydrogen 
atom, the energy levels are degenerate, the energy being determined 


17-3580 
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by only one quantum number. In multi-electron atoms, degeneracy 
is removed as a result of interaction and the energy of an electron 
in an atom depends on all the quantum numbers. It goes without 
saying that in ahydrogen atom an electron at a level for which n=3 
has less energy than an electron at a level for which n=4. This is not 
necessarily so in multi-electron atoms when the azimuthal quantum 
number is large. Thus, in a number of cases, the “natural” consecu- 
tive order of quantum numbers does not correspond to theorder in 
which the energy states of an atom are filled. 

It is customary to group the electrons of an atom into shells, 
whereby electrons having the same principal quantum number are 
said to belong to the same shell. The shells are usually designated 
by the following letters: 

K, L, M, N,. 


which correspond to n=], 2, 3, 4, ..., respectively. 

The electron cloud of an atom is described in the main by the 
distribution of electrons according to quantum numbers or shells. 
This distribution is represented by formulas which indicate by a 
superscript the number of electrons having the same n and J. For 
example: 

silicon—Is* 2s? 2p°* 3s? 3p); 
calcium—1s? 2s? Ops 3s” 3p* 4s”. 


To determine to what extent a shell is complete, one should re- 
member that the maximum number of electrons in the subshells 
s, p, d, f,... is 2, 6, 10, 14,..., respectively. These values are obtained 
from the formula 2 (21+). 

Returning to the Mendeleyev table, let us see where the order of 
distribution of electrons according to quantum numbers is violated. 
The first such violation occurs for potassium. The last electron is in the 
As level rather than in the 3d level. Calcium, the next element in the 
table, receives another 4s electron. Then, beginning with scandium, 
the 21st element, the 3d level is built up. But when we reach chro- 
mium, the 24th element in the table, a new anomaly arises. The 
order of distribution of quantum levels according to energies has 
changed. It becomes unfavourable from the energy viewpoint to have 
two electrons in the 4s level. The configuration of chromium is there- 
fore 3s* 3p° 3d’ 4s. 

We shall not discuss the remaining anomalies. The electron 
configurations for all elements may be found in any physics or chem- 
istry handbook. The main point is the following: the distributions 
of electrons according to quantum numbers, which are explained by 
purely physical methods of investigation (spectral analysis and the 
measurement of magnetic moments), aid us in understanding the 
chemical properties of the various elements. 
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193. Ionisation Potentials 


One of the methods used to study the electron configuration of 
an atom is to measure its ionisation energy, i.e., the energy that must 
be expended in removing an electron from the atom. Since the energy 
of an electron in an atom is negative and is reckoned from zero (the 
energy of an electror’ removed from an atom), the ionisation energy 
is simply equal to the energy level occupied by the electron in the 
atom. It is customary to refer this energy to the electron charge and 
express it in volts. For example, we say that the ionisation potential 
of a hydrogen atom is equal to 13.53 volts. This means that to free 
an electron one must perform work equal to that of moving an elec- 
tron through a potential difference of 13.53 volts. Fig. 214 shows 
the significance of this value. 

In the case of a multi-electron atom, one may find a series of ion- 
isation potentials which characterise the levels of the first, second, 
third, etc., electrons, calculating from the position of the least bound 
electron. In this sense, one speaks of the first, second, etc., ionisa- 
tion potential of a given atom. 

There exist many methods of measuring ionisation potentials. 
For this purpose, gases or vapours are placed in an electric field. 
A stream of electrons emitted by a heated filament ionises the gas. As 
long as the energy of a primary electron is insufficient to dislodge an 
atomic electron, the electric current passing through the gas does 
not change. When the energy of the primary electrons becomes 
sufficiently great to dislodge electrons from the atomic gas, a con- 
siderable number of positively charged ions will be present in the 
region and a sharp increase in the electric current occurs. By grad- 
ually increasing the voltage applied to the apparatus, one can 
determine very accurately the instant when this increase in electric 
current begins. This critical value of voltage gives the magnitude 
of the ionisation potential. The values of the first ionisation 
potentials of most chemical elements are graphically illustrated in 
Fig. 220. 

It is easily seen that the periodicity of this property completely 
conforms with that of the periodic table. It is most difficult to dislodge 
an electron from a helium atom and atoms of the other noble gases. 
This is precisely the reason for their chemical inertness. The uni- 
valent alkali metals have the lowest potentials. This, too, iscomplefe- 
ly understandable to the chemist, who is familiar with the excep- 
tional ability of such substances to enter into reactions. 

The values of the first and successive ionisation potentials are related 
to the valency of the atoms. Atoms of the alkali elements are univalent 
because one electron, which is on the outer ring of the atoms of these sub- 
stances, is more weakly bound than the rest of the electrons. The first 
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Fig. 220 


potentials of acaesium atom, for example, have the following values: 
3.9, 27, 46 and 62 volts. It is seen that the differences between the ener- 
gies necessary to dislodge the first and succeeding electrons are quite 
large. 


194. Atomic Spectra in the Optical Region 


Atomic absorption spectra as well as emission spectra may be ob- 
tained, but only the latter are of basic importance. Atomic spectra 
of emission in the optical region may be obtained by spectroscopic 
investigation of the radiation produced by gases and the vapours of 
bodies which are solid at normal temperatures. 

In order for atoms to radiate, they must be excited, i.e., made to 
pass from a lower energy level to a higher one. When atoms return 
to lower energy levels, an emission spectrum is produced. For every 
transition, there is a corresponding line in the spectrum. 

Atoms may be excited by various means. One method consists 
in the use of a gas discharge. The voltage applied to a gas-discharge 
tube accelerates the charged particles in the gas. These particles 
collide with neutral atoms, to which energy is transmitted by impact. 
Another method, which is used in the spectral analysis of metals, 
consists in the creation of arcs or sparks between two electrodes made 
of the material under investigation. Very high temperatures are 
produced in an arc or spark, resulting in the vaporisation of the sub- 
stance in the region of the discharge. The atoms are excited as the 
result of collisions. 


Sec. 194] Atomic Spectra in the Optical Region 501 


An atomic emission spectrum consists of a very large number of 
sharp lines. The radiation frequency corresponding to a given line 
satisfies the equation hv,,, =E,—E,,. Thus, by measuring the fre- 
quencies of radiated light, we can determine the differences in the 
energy levels of a given atom. One can reliably interpret atomic 
spectra, i. e., determine energy level patterns, from the values of 
radiation frequencies. Handbooks provide data on the spectral lines 
and energy levels of the chemical elements. 

It should not be supposed that a spectrum contains lines corre- 
sponding to all transitions from any one level to any other. Experi- 
ments have confirmed, and a theoretical basis has been provided 
for, the fact that certain selection rules exist. Certain transitions 
are forbidden, i.e., they do not exist. 

One cannot predict, of course, to which lower energy state an 
excited atom will pass, and what will be the frequency of the radiated 
spectral line. But not all transitions occur with equal probability. 
In principle, the probability of transition from one level to another 
may be theoretically calculated. The magnitude of this probability 
ie in the main, the intensity of the corresponding spectral 
ine. 

Atomic spectra are affected by external fields. If the substance 
under investigation is located in an electric or magnetic field, a 
number of its spectral lines split up into several components. The 
energy of a system having a magneticmoment M and located in an 
external magnetic field H is given by the expression MH (see p. 272). 
States which have the same quantum numbers n and / may differ 
from each other with respect to the projection of the magnetic moment 
on the direction line of the magnetic field. Therefore, the application 
of a magnetic field removes the degeneracy of energy levels and 
atomic electrons having different magnetic quantum numbers will 
have different energies. 

Investigations of atomic spectra of emission in the optical region 
are of great practical importance. Such a method of spectral analysis 
constitutes a very sensitive means of determining the chemical 
composition (up to 107*° gm) of substances, primarily alloys, and in 
a number of cases is more sensitive than chemical analysis. 

Optical frequencies usually arise with relatively weak excitation 
of an atom, i.e., when outer, valence electrons are transferred to 
a higher level. But even a very “high” electron can produce a broad 
spectrum. It would appear that the radiation frequency has no lower 
limit. Thus, the energy level diagram shows that as n increases the 
levels come closer together (Fig. 214 shows the levels and transitions 
for hydrogen, but in principle the patterns are the same for other 
atoms). This means that transitions corresponding to very low fre- 
quencies (long wavelengths) occur. However, experiments show that 
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spectra produced by outer electrons, even though they extend into 
the infrared region, do not include lines of very long wavelength. It 
must be concluded that the probability of a transition to some energy 
level such as the 21st is not large, and the probability of a transition 
from the 2lst to the 20th, for which a photon of low v would be ra- 
diated, is quite negligible. 

In the direction of high frequencies (short wavelengths), the fre- 
quency is limited by the ionisation potential. With respect to the 
“highest” electron, the potential of helium is the greatest and that 
of caesium the lowest, viz., 24v and 4v, respectively. This corre- 
sponds to radiation frequencies of 6 x 10'* cps (A=500 A) and 10** cps 
(1 =3,000 A), respectively. Thus, only a high-level electron can bring 
us into the region of very short ultraviolet wavelengths, which, 
relative to characteristic X-ray radiation, may also becalled a region 
of very long wavelengths. 

It is quite understandable that electrons of inner shells can be 
raised to high levels with strong excitation. In such a case, the char- 
acteristic spectrum includes X-rays. 


195. Atomic X-Ray Spectra 


In multi-electron atoms, the ionisation potentials of low levels 
reach high values. The excitation of such atoms may, therefore, re- 
sult in the radiation of X-rays (wavelengths of the order of 0.1-10A). 
An energy of the order of 10* ev must be imparted to an atom in order 
to produce X-ray radiation. This may be achieved in gas-discharge 
tubes by applying a voltage of tens of thousands of volts. 

One may calculate the value of the temperature at which an atom 
begins to radiate X-rays due to thermal collisions with other atoms. 
If the average kinetic energy per degree of freedom is to be of the or- 
der of 10* ev, the temperature must be of the order of 10° °K. Such 
high temperatures are achieved in solar and celestial atomic explo- 
sions (see p. 579). The X-ray radiation of the Sun may be determined 
by means of instruments placed in artificial Earth satellites. 

A practical method of obtaining X-rays is by the bombardment 
of a solid (the anti-cathode of an X-ray tube) with a stream of elec- 
trons. Electrons impinging on the anti-cathode are abruptly braked. 
As a result, a continuous spectrum of X-rays is obtained. The elec- 
tron energy, which has been increased to a value @, by acceleration 
in an electric field, decreases to a value @, as the result of braking. 
The energy difference @,—@,=hv is released in the form of radiation. 
@» may assume any value from @, to zero. Hence the radiation fre- 


quencies lie between the limits v=o and zero. The electron energy 
which does not go into radiation is transformed into heat. Only 
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about one-hundredth of the energy of the electron beam is trans- 
formed into X-ray energy. Evidently, the continuous X-ray spectrum 


has a short-wavelength limit Xp;,=—°-=34. Substituting the 
Max 


values of the constants, we obtain 
4. 123 


min V 2 


where i is expressed in Angstroms and V in kilovolts. Beginning at 
a very definite wavelength, the continuous X-ray spectrum increases 
in intensity with increasing wavelength, reaches a maximum several 
score Angstroms from the short-wavelength limit, and then slowly 
decreases in intensity. 

Investigations show that sharp lines, having a characteristic form 
for every element, are superimposed on the continuous spectrum. A 
characteristic X-ray spectrum arises owing to the fact that some of 
the electrons which impinge on the anti-cathode penetrate the atoms 
and dislodge inner electrons, i.e., electrons in the K, L, etc., shells. 
An X-ray quantum is produced when a high-level electron passes 
over to a vacated low-level position. The set of spectral lines due to 
electron transitions to the K level is called the K series, to the L 
level the L series, etc. If the voltage applied to an X-ray tube is 
increased, the series will appear in consecutive order because, as 
the energy of the electrons impinging on the anti-cathode is increased, 
more and more low-energy levels will be consecutively vacated and 
made available for transitions. The K series will be the last to appear. 

The general scheme of electron X-ray transitions is shown in 
Fig. 221, where heavy dots indicate initial levels. The most intense 
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lines are marked on the diagram. However, some transitions are miss- 
ing since they are forbidden by the selection rules. This pertains, 
for example, to transitions with the same value of azimuthal quantum 
number. 

Since the configuration of completed lower shells is the same for 
all atoms, X-ray spectra diagrams of different atoms are very similar 
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to each other. All spectra contain typical sequences of lines, which 
are systematically shifted along the wavelength scale in accordance 
with the atomic number of the elements. For example, all elements 
produce a strong a doublet (Ka, and Ka,) and a weaker B doublet. 
Quite often these doublets are unresolved. In such a case, one speaks 
of the a line and 6 line of a given element’s K series. These doublets 
are of a “spin” nature. 

Fig. 222 shows that a characteristic spectrum is gradually shifted 
into the short-wavelength region with increasing atomic number of 
the element giving rise to this spectrum. This law was discovered 
by Moseley. Its physical basis is the steady increase of the inter- 
action force between an electron and anucleus with increasing nuclear 
charge. The formula expressing this law will not be presented, but 
the systematic displacement of the lines is clearly evident from the 
figure. 


CHAPTER XXIX 


MOLECULES 


196. Chemical Bonds 


A molecule is a stable configuration of atoms. Every atom in a 
molecule occupies a stable position. The displacement of an atom in 
any direction results in an increase in the potential energy of the 
molecule. When an atom approaches a neighbouring atom there is 
a force of repulsion, and when it recedes a force of attraction. Every 
atom of a molecule, and the molecule as a whole, U 
is in a potential well. 

The form of the potential curve of an atom 
or molecule is quite evident (see Fig. 223). Since 
it is not possible to reduce the distance between 
atoms to zero, the curve of potential energy as 
a function of their separation rises sharply as 
this distance decreases. In the direction of in- 
creasing separation, the curve rises from the 
equilibrium position, i.e., the bottom of the 
well, much less sharply. Variations are possible: 
The potential energy at great distances may be 
more or less than at the bottom of the well and 
the well may have or may not have a clearly 
defined wall. The energy of a molecule may be 
more or less than the sum of the energies of its 
atoms. Accordingly, when atoms are combined 
in a molecule, heat is either released or absorbed Fig. 223 
(see p. 565). 

An atom in a potential well is bound to its neighbours. What is 
the reason for this bond? Do various types of bonds exist? Jonic 
and homopolar bonds are two ideal classifications of chemical bonds. 
In the overwhelming majority of cases of interest in chemistry, one 
of these two types of bonds, or an intermediate case in which both 
ideal types coexist, occurs. 

If an atom can transfer one or several electrons to another atom 
electrostatic attraction will occur between the ions which are formed. 
This is what is meant by an ionic bond. At a certain interatomic 
separation which is characteristic of this pair of ions, the forces of 
electrostatic attraction are counterbalanced by the repulsion of the 
electron clouds of the atoms. 
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If an atom is to transfer an electron to another atom, it is neces- 
sary that this process be advantageous from the energy viewpoint. In 
such a case, the simple tendency to pass over to the lowest energy 
level will result in the transfer of an electron. 

It was shown on p. 486 that an electron can be torn away from a 
neutral atom by an expenditure of energy equal to the product of 
an electron charge and the ionisation potential of the atom. Thus, 
the formation of a positive ion is always associated with an expendi- 
ture of work. On the other hand, the formation of a negative ion, 
i.e., the attachment of an electron to a neutral atom, is associated 
with a release of energy. To be sure, this applies only to the first 
electron. The attachment of a second electron to a singly charged 
negative atomic ion requires an expenditure of work to overcome the 
electrostatic repulsion. 

An ionic bond can exist if the energy of tearing away an electron, 
i.e., the work of creating a positive ion, is less than the sum of the 
energy released in the formation of negative ions and the energy due 
to electrostatic attraction between the ions. 

The alkali metals, in which the last electron is just beginning to 
form a new shell, have the lowest ionisation potentials. In the 
alkaline earth metals, each atom has two loosely bound electrons. 
It is evident that the formation of a positive ion from a neutral atom 
requires the least work when the electrons to be torn away are just 
beginning to form a new shell. 

On the other hand, it turns out that the most energy is released 
when an electron becomes attached to a halogen atom, in which the 
outer shell is one electron short of being complete. Therefore, in a 
large number of cases, an ionic bond is formed when a transfer of 
electrons resulting in the creation (in the formed ions) of closed 
electron shells, characteristic of atoms of the noble gases, occurs. 
In this way, the physical significance of potential wells in such 
molecules as NaCl and MgCl, may be easily explained. 

However, this explanation is not valid in all cases. For example, 
diatomic molecules of hydrogen, oxygen, etc., are not covered by 
this explanation. It cannot be assumed that in uniting one of these 
atoms is transformed into a negative ion and the other into a positive 
ion. Theoretical arguments need not be mustered. The physical pro- 
perties of molecules formed of ions indicate whether an ionic bond 
exists or not. Specifically, ionic compounds dissociate and form 
electrolytes. A large class of organic molecules do not behave in 
this manner. Therefore, for such substances, an ionic model is clearly 
not applicable. 

How can one explain the bond between atoms of such molecules? 
We must determine whether or not a gain in energy occurs when, say, 
two hydrogen atoms unite to form a molecule. 
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Such a gain does take place, and the conditions for its occurrence 
are given by quantum mechanics. As was stated on p. 477 the electron 
of a hydrogen atom behaves, in the main, like an electron in a po- 
tential well. The zero energy level of an electron in a potential well 
is determined by the dimensions of the well (see p. 479), i.e., the small- 
er its dimensions the greater the zero-point energy. Thus, any ex- 
pansion of the region in which the electron could move results in 
a decrease in energy. 

Now, imagine that two hydrogen atoms, which have one electron 
each, come into contact with each other. Since the Pauli exclusion 
principle allows two electrons to be in one state, the regions in 
which the electrons exist may merge and create a potential well of 
increased dimensions. This can occur only for two electrons having 
opposite spins. 

If a third atom approaches the hydrogen molecule which has 
formed, the argumentation used above is no longer applicable. The 
third electron cannot merge its region of motion with that of the 
electrons in the hydrogen molecule since this is not allowed by the 
Pauli exclusion principle: the vacant sites of the hydrogen molecule 
are occupied by two electrons of opposite spin. 

Thus, the second type of bond, a so-called homopolar bond, is 
provided by a pair of electrons of opposite spin. In the case of an 
ionic bond there is a transfer of electrons from one atom to another, 
but here the bond is achieved by joint action of the electrons, i.e., 
it is as if a common region of motion has been created. An expansion 
of the region in which an electron may move results in a decrease 
in energy and this is the reason for the formation of a potential 
well, This bond—the merging of the electronic clouds of electrons 
having opposite spins—is the main type of bond in organic mole- 
cules. 

Each atom is capable of forming a limited number of homopolar 
bonds. Two electrons of opposite spin, having a common “living 
space” in the form of the overlapping clouds of their wave functions, 
take part in the creation of each bond. 

As we know, s electrons have spherically symmetrical sp-func- 
tions, but the w-functions of p, d and f electrons extend in specific 
directions. Therefore, a homopolar bond between any two electrons, 
except s electrons, will be a directed bond. If a bond has formed 
between two atoms, the electronic clouds of these atoms assume a 
a definite orientation relative to the first bond line. Thus, only cer- 
tain specific angles are formed between bond lines emanating from 
these atoms. The values of such normal bond angles may be derived 
from quantum mechanics for all atoms. 

To acertain extent, both types of bonds are ideal. We frequently 
encounter cases in which the physical and chemical properties of 
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a molecule make it necessary to adopt an intermediate bond mecha- 
nism. In an ionic bond an electron is completely transferred from one 
atom to another and in a homopolar bond each electron belongs 
equally to both bound atoms, but in intermediate cases the electrons 
implementing a bond may spend more timenear oneof the atoms than 
the other. Such a model reflects, for example, the existence of an ionic 
bond in which the bond electrons belong most of the time to a neg- 
ative ion and the existence of a homopolar bond in which the bond 
electrons spend almost the same amount of time with each of the 
bound atoms. Intermediate bonds of any percentage of “ionocity” 
are possible. 


197. Geometries of Molecules 


A vast amount of data on spacings between the centres of 
atoms in molecules and crystals has been accumulated. Most of 
this data has been obtained by diffraction methods. If we do not in- 
sist on a very high degree of accuracy, it turns out that it is possible 
to represent molecules by models which give the shape and dimen- 
sions of the molecules. 

Models of molecules of the NaCl type, in which the atoms are joined 
by an ionic bond, are particularly simple. Each ion may be rep- 
resented by a sphere having a definite radius. The dimensions of 
-a number of ions are given in the following table: 


Jon | Lit | Na+ | K+ [as ber [ ci- | Br- | I- 


Jon radius, A | 0.60 | 0.95 | 1,33 | 1.69 | 1.36 | 1.81 | 1.95 | 2.16 


By means of such a table, we can determine the spacing between the 
centres of ions in any salt. For example, in NaCl it is equal to 0.95+- 
+1.81=2.76 A. 

But what is the significance of the assertion that an ion may be 
represented by a sphere? To show that such a representation is justi- 
fied, we must determine how closely to a molecule (say NaCl) another 
ion (sodium or chlorine) may approach. This is possible since exper- 
iments indicate that both fused and solid salts consist of ions. It 
turns out that a second and a third ion approach a given ion just 
as Closely as the first. Moreover, ions-the charges of which have the 
same sign may also approach each other to a distance equal to the 
sum of the ion radii. Thus, ions behave like spheres. 

An important conclusion regarding ionic molecules may be drawn 
from these geometric facts. Let us assume that a group of molecules 
are gathered closely around one of the molecules. The arrangement 
of ions is shown in Fig. 224. Since complete equality exists in in- 
teratomic spacings, it is no longer possible to say with which chlo- 
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rine neighbour of a given sodium ion, or with which sodium neigh- 
bour of a given chlorine ion, a molecule is formed. The concept of 
a molecule has lost its meaning. 

We must conclude from the geometric arrangement of atom centres 
that in a concentrated state, i.e., in a liquid or solid, where the 
atoms are linked by ionic bonds, mol- 
ecules do not exist as distinct forma- 
tions. The concept of a molecule turns 
out to be inapplicable. 

But what is thesituation in thecase 
of a gas? Upon vaporisation, a pair 
of ions of opposite charge, the net 
electric charge of which is equal to 
zero, is most easily torn away from 
a liquid. Therefore, basically, mole- 
cules of the NaCl type are to be found 
in vapours. However, in addition to 
molecules, ions are also vaporised from 
a liquid. 

The situation is entirely different in the case of molecules having 
homopolar, i.e., covalent, bonds. An analysis of interatomic spac- 
ings encountered in molecules shows that spacings between atom 

centres may be calculated by means of so-called atomic radii. The 
’ values of such radii in Angstroms for the most often encountered 
atoms are as follows: 


C— C= C= H— O—~ O= OF 
0.771 0.665 0.602 0.30 0.66 0.55 0.50 


Atomic radii decrease with increasing multiplicity of the valent 
bond. The table shows that the separation between two bound carbon 
atoms, C—C, is 1.54 A; the separation in C—H is 1.07 A, etc. 

In constructing a model of a molecule, we also have at our disposal 
certain elementary data on bond angles. The existence of normal bond 
angles is understandable from considerations of symmetry and is in 
agreement with certain qualitative quantum mechanical reasoning 
discussed in the preceding article. Thus, the normal bond angle of acar- 
bon atom that is linked to four atoms is a tetrahedral angle (109°28’). 
In thecase of an aromatic carbon atom, as well as other carbon atoms! 
that are linked to three atoms, the normal bond angle is equal to 
120°. Finally, the characteristic bond angle of a carbon atom that is 
linked to two atoms is 180°. 

The normal bond angles of oxygen, sulphur and nitrogen atoms 
which are linked to two atoms in the case of oxygen and sulphur 
and to three atoms in the case of nitrogen are equal to 90°. The ni- 
trogen atom in the nitro group NO, has a normal bond angle of 120°. 





Fig. 224 
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In a number of cases, bond angles deviate considerably from the 
“normal”. In certain cyclic compounds of the cyclobutane type, the 
angles are equal to 90° rather than 109°28’. Such deviations are due to 
spatial obstacles. However, before discussing this, we must clarify a 
third geometric characteristic of a molecule, namely, its inter- 
molecular radius. 

Investigations of molecular arrangements in crystals have shown 
that each atom may be assigned an intermolecular radius, such that 
on the average neighbouring molecules will touch each other. Thus, 





Fig. 225 


for example, the intermolecular radius of hydrogen is 1.17 A, oxygen— 
1.36 A, nitrogen—1.57 A, etc. This does not mean, however, that the 
distances between atoms of the same molecule which are not linked 
by valent bonds are determined by these values. The dimensions and 
the form of a molecule are determined by interaction between the 
forces establishing equilibrium distances between atoms which are 
not linked by valent bonds and the forces establishing normal bond 
angles. Since the bond forces between atoms are an order of magni- 
tude greater than the other forces, the interatomic distances do not 
change and the configuration of the molecule is determined by com- 
petition between the elasticity of a bond angle and the compressi- 
bility of the intermolecular sphere of an atom. 

Here is a simple, but graphic example. Experiments show that 
the bond angle in a molecule of water is equal to 105°. The distance 
between hydrogen atoms is 1.54 A. Therefore, considerable compres- 
sion of the intermolecular spheres of the hydrogen atoms occurs. 
This compression (2 x 1.17—1.54 =0.8 A) is balanced by the elasticity 
of the bond angle, the normal value of which is equal to 90°. Thus, 
the forces which compress the hydrogen atoms by 0.8 A are equal to 


Sec. 198] The Electronic Cloud of a Molecule 511 


the forces which change the angle from 90° to 105°. Such a simple 
mechanism explains the difference between the structure of a hy- 
drogen sulphide molecule and that of a water molecule. Since the 
length of the hydrogen-sulphur bond is considerably longer than that 
of the hydrogen-oxygen bond, the hydrogen atoms of the former 
molecule are considerably less “crowded”. It turns out that in hydro- 
gen sulphide the distance between hydrogen atoms is equal to 1.99 A 
and the bond angle is equal to 92°. The compression of hydrogen 
atoms by 0.35 A is balanced by a change of only 2° in the bond angle. 
Many organic molecules may be used to illustrate the validity of 
this mechanism. 

Fig. 225 shows the structure of a chlorobenzene molecule. To 
the left is a carbon atom; in the centre, the beginning of the build 


up, Viz., a C group; and to the right, a model of the molecule. 
CONG 


198. The Electronic Cloud of a Molecule 


Electron motion in a molecule, just as in an atom, is described 
by a wave function. Strictly speaking, the w-function is a function 
of 3n coordinates, where n is the number of electrons in the molecule. 
Then, w’ will give the probability of any electron distribution, i.e., 
the “electron density”. 

It has been noted already that the solution of the Schrédinger 
equation for multi-electron atoms is very complex. In the case of 
molecules, the difficulties are, of course, even greater. Only approx- 
imate, semiempirical methods of calculation are applicable here. 
In this connection, physical methods of determining electron density 
are of particular importance. However, even when such methods 
are used, the results are quite limited. 

The time-averaged electron density of a molecule is determined 
by means of X-ray structural analysis (the electron density gives 
the probability that the electrons are at a given location). As a 
result of the vibrations of atoms inside a molecule, and of the mole- 
cule as a whole, a photograph of an electronic cloud is smeared. Fig. 
167 (p. 386) shows a cross-section of the electron density pattern of 
an anthracene molecule. The coarseness of the method may be gauge 
from the fact that the hydrogen atoms of the molecule are not appar- 
ent in all cases. The method used in plotting the pattern is similar 
to that used in the construction of topographical maps. Electron 
peaks and valleys are indicated by lines connecting points with the 
same electron density. Each atom is represented by an electron den- 
sity “hill”. Superposition of the bell-shaped density functions of two 
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atoms along a bond line results in the formation of a “bridge” between 
the atoms. Unfortunately, the accuracy of the method is too poor 
to enable us to determine thenatureof the chemical bond by measur- 
ing the height of this bridge. Its height is indistinguishable from 
the sum of the density functions of two free atoms. But the specific 
nature of the chemical bond should probably be manifested in an 
additional increase in electron density (as compared with free elec- 
trons). Such electronic cloud patterns are, therefore, merely inter- 
esting illustrations of molecular structure. 

If the electron density with respect to the atomic nuclei of a mole- 
cule were known, we would be able to calculate the dipole moment 
of the molecule. For this purpose, it would be necessary to determine 
the centres of “gravity” of the positive and negative charges. The 
dipole moment has not yet been determined in this manner, although 
comparison of neutronographic (neutrons scattered on nuclei) and 
roentgenographic data could be used to solve such a problem. How- 
ever, the dipole moment of a molecule can be reliably measured 
(see p. 651) and it is then possible to solve the converse problem, 
namely, determine the centre of “gravity” of negative charge by 
means of the dipole moment. 

It would seem that in purely ionic molecules we encounter the 
extreme case in which the centre of gravity of an electronic cloud 
coincides with the centre of an anion. The dipole moment of KCI, for 
example, could then be predicted in the following manner. If one 
electron is taken from a potassium atom and transferred to a chlorine 
atom, one “extra” positive charge will be separated from one “extra” 
negative charge by the distance between the potassium and chlorine 
centres, i.e., 1.81+1.33=3.14 A. Hence, the dipole moment will 
be equal to 3.14x4.8x107'*=15 Gaussian units. But experiments 
yield a value of 6.8 Gaussian units. This means that even in the case 
of such a classical ionic bond the potassium electron does not go 
over completely to the anion. On the other hand, the other extreme 
case is fully realised. Evidently, symmetrical molecules such as Hg, 
O, and benzene cannot have a dipole moment: the centres of gravity 
of the electronic cloud and the nuclei coincide. 

One other property of an electronic cloud should be mentioned, 
“namely, its ability to be displaced relative to a nucleus. Electronic 
clouds may be displaced relative to nuclei by means of an electric 
field. Since nuclei are much heavier than electrons, it may be assumed 
that the nuclei remain fixed. The displacement of the electronic 
cloud of a molecule may be described by the displacement of its 
centre of gravity. When the centre of gravity of negative charge is 
displaced relative to thecentre of gravity of positive charge by a dis- 
tance x, the molecule acquires an induced dipole moment p=Nex, 
where NV is the number of electrons in the molecule. The induced 
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dipole moment increases linearly with the field, i.e., p=BE. It 
is customary to describe the displacement of the centre of gravity 
of an electronic cloud by 6, the magnitude of the polarisability of 
the molecule. The quantity 


B has the dimensions of vol- j-3 
ume. The greater the volume je2 
of the molecule, the greater Jeli oo 
the value of B (see Chapter 4&2 —————————u3 
XXXV). 5 ee era 
5) see 
199. Energy Levels of jp 
Molecules 74 
The energy of an atom j=2 


changes only by one means: a i j=], ——_—__—_ = 
change occurs in its electron 7 











motion, i.e., an electron J=3 

passes into another quantum j-2: —______ 

state. The energy of amolecule SEES 2? 
may also change in this man- j=0 u=0 
ner, but by other means as 

well. For example, the atoms 

of a molecule vibrate relative d3 

to one another. The vibration- je2 

al energy is an integral part je] see SS 

of the energy of a molecule #0 —————————u3 
and also may assume only a 43 

discrete set of values. Fur- j-2 

thermore, a molecule rotates ;, J=i 

as a whole. The rotational 49 — U2 
energy is also quantised and J-3 

a change in the state of a J=2 

molecule may result in a jeg J=/ we 
change in rotational energy. 

Therefore, the energy state of a J 

molecule is described by in- J=2 —______ 

dicating the state of its elec- oS ree e” 
tronic cloud (electron level), 4-9 u=0 
the state of its vibrational Fig. 226 


motion (vibrational level) and 
the state of its rotational motion (rotational level). We deal with 
three kinds of information—analogous, so to speak, to a house 
number, floor number and apartment number. 

But which is analogous to a floor number and which to an apartment 
number? Which energy levels are separated by large intervals and 
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which by small ones? The answers to these questions are contained 
in the energy level diagram shown in Fig. 226, which is based on 
experimental results and theory. Two electronic levels, e’ and e”, 
are shown in this figure. Associated with each electronic level is 
a group of vibrational levels designated by a set of v values, and 
associated with each vibrational level is a group of rotational levels 
designated by a set of j values. 

Clearly, the intervals between rotational levels are less than be- 
tween vibrational levels, and those between vibrational levels are 
less than between electronic levels. 

Let us assume that a molecule may have electronic levels at 100, 
200, 300,... energy units, vibrational levels at 10, 20, 30,... units, 
and rotational levels at 1, 2, 3,... units. In such a case, a molecule 
at the second electronic level, first vibrational level and third 
rotational level will have an energy of 213 units. 

Thus, the energy of a molecule may be given in the form 


V= Wy ae W vin +W jot: 


The frequency of radiated or absorbed light may always be deter- 
mined from the difference in energy between two levels, i.e., 


1 
v= h (AW ,,, i AW aip =e AW 7 o1)- 


It would be interesting to examine transitions involving a change 
in only one “kind” of energy. Practically, this is possibleonly for 
rotational transitions and it is easily seen why this is so. 

Let us investigate the absorption of electromagnetic waves by 
a group of molecules. Beginning with the longest wavelengths, i.e., 
smallest packets of energy Av, we find that the molecules do not 
absorb energy as long as the magnitude of a quantum of energy is 
less than the difference between two neighbouring levels. By grad- 
ually increasing the frequency, we eventually obtain quanta of 
energy that are capable of raising molecules from one “rotational” 
level to another. Experiments show that this occurs in the micro- 
wave region (at the end of the radio band) or, in other words, in the 
far infrared spectrum. It is found that wavelengths of the order of 
0.1-1 mm are absorbed by molecules. Thus, a pure rotational spec- 
trum may be obtained. 

By further increasing the frequency, we enable the rotational 
spectrum to become more developed, but nothing new occurs until 
the quanta of energy impinging on the substance are of sufficiently 
high frequency to make molecules pass from one vibrational level to 
arfother. It is clear, however, that a pure vibrational spectrum, i.e., 
a series of transitions for which the number of the rotational level 
does not change, is never obtained. Transitions from one vibrational 
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level to another involve various rotational levels. For example, a 
transition from the zero (lowest) vibrational level to the first may 
be accomplished by molecules from the fourth rotational level to 
the third, the third to the second, etc. Thus, there arises a vibration- 
rotational spectrum, which may be observed in infrared light (3- 
50 pw). Clearly, all transitions from one vibrational level to another 
are close to one another and yield a group of very close lines in the 
spectrum. For low resolution, these lines merge into one band. Each 
band corresponds to a definite vibrational transition. 

By increasing the frequency still further, we finally reach a new 
spectral region, which is characteristic of a molecule. This occurs 
in the optical and ultraviolet portion of the spectrum where the energy 
of a quantum suffices for the transition of a molecule from one elec- 
tronic level to another. Here, of course, neither pure electronic 
transitions nor pure electronic-vibrational transitions are possible. 
Electronic-rotational transitions, involving a change in “house”, 
“floor” and “apartment”, occur. Since a vibration-rotational transition 
gives rise to a band, the spectrum in the optical region is “striped”, 
i.e., it consists of a system of bands. 

Now, let us discuss the various types of molecular spectra in detail. 


200. The Rotational Spectrum of Molecules 


Free rotation of molecules occurs only in the gas state. Therefore, 
basic data on rotational energy levels are obtained by studying gas 
spectra. Investigation of these spectra by optical means is very dif- 
ficult. Much more suitable for this purpose is a radio-spectroscopic 
procedure that has been developed during recent years. A generator 
of electromagnetic waves transmits radiation through a waveguide* 
which is partially filled with the gas under investigation. After pass- 
ing through the gas, the electromagnetic waves arrive at a receiver 
which measures their intensity. This measurement may be performed 
over a large range of frequencies. The width of the band of frequen- 
cies generated by radio methods may be made so narrow that the 
resolving power becomes hundreds of thousands of times (!) greater 
than in the case of optical methods. Optical methods enable us to 
distinguish lines separated by0.1cm~’, but by radio methods we can 
distinguish lines separated by 10-° cm~'**. By means of this high 
resolving power, we are able to solve a number of interesting 


* A waveguide is a metallic duct of rectangular or circular cross-section 
through which centimetre radio waves may be propagated with practically no 
losses. ‘ 

** In spectroscopy, in addition to wavelength units, it is customary to use 
a reciprocal wavelength unit (wave number), i.e., the number of waves per 
centimetre. 
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problems which are discussed below. A rotational spectrurn arises as a 
result of the quantisation of a molecule’s kinetic energy of rotation: 


lw? 
K rot = o> 


where / is the moment of inertia of the molecule. This is the form of 
the expression for the energy of a diatomic molecule. This energy is 
described by a single moment of inertia taken about an axis perpen- 
dicular to the line joining the atoms and which passes through the 
centre of inertia. As was indicated earlier (p. 75), in the general case 
the rotation is described by three moments of inertia taken about 
three main axes. 

Briefly, let us consider the rotational spectra of diatomic mole- 
cules. 

First, it should be emphasised that not all molecules, including 
diatomic molecules, will yield a rotational spectrum of radiation or 
absorption. As has been explained already (see p. 319), every radiator 
or absorber of electromagnetic waves is a kind of oscillator, i.e., 
an elementary dipole. If the atomic motion of a molecule or the motion 
of a molecule as a whole is not accompanied by a change in dipole 
moment, such motion cannot result in theradiation or absorption of 
electromagnetic waves. 

When a molecule radiates or absorbs energy, its dipole moment p 
varies periodically as the oscillation frequency. The dipole moment 
oscillates about an average value, corresponding to the equilibrium 
positionof the atoms. It may be shown that the intensity of the spectral 

dp 


lines is proportional to the derivative (2) , i.e, the maximum 
r=0 


rate of change of the dipole moment with respect to interatomic 
spacing. All symmetrical molecules the atoms of which are joined by 
homopolar bonds have a constant zero value of p. Therefore, they do 
not give rise to rotational spectra. Such molecules include, for exam- 
ple, all diatomic molecules of the same atoms (Hg, Og, Ng, etc.). 

Let us consider the rotational spectrum of a diatomic polar mole- 
cule, i.e., a molecule possessing a dipole moment. The rotational 


energy of such a molecule is Kot = 3 here @ is the angular ve- 
locity of rotation and / the molecule’s moment of inertia: 


MMs 
m, +m, 


2 2 
= M1 ,0, + Mf, = r’, 





where r, and r, are the distances to the centre of inertia and r=r,+ To. 
The value of @ is determined from the fact that according to a rule 
of quantum mechanics (p. 487) the rotational momentum, /@, may 


Sec. 200] The Rotational Spectrum of Molecules 517 


assume only the discrete set of values 
A ane oe 
se VIG, 


where j=0, 1, 2,... is the quantum number designating the rotation- 
al levels. Therefore, the angular velocities of rotation of a molecule 
may assume only the following set of values: 


h Permrnas Tt 
0; = ap V I Get); 
hence, 


Ki = 2 = =n lt (j+1)}. 


Beginning with a zero energy of rotation, the energy of successive 
levels increases in accordance with a square law. 

Energy transitions are subject to a simple selection rule, i. e., 
only transitions between neighbouring levels are allowed (Fig. 227). 

The radiation or absorption frequency 
in the rotational spectrum of a diatomic 
molecule is given by 

hj 
v= ing i = 9, 1, 2,...) 

for a transition between the j and j—1 
levels. In this simple case, the rotation- 
al spectrum consists of a system of 
equally spaced lines. 

For different gas temperatures, the Fig. 227 
average energy of rotation of a molecule 
differs. In accordance with Boltzmann’s law, the most probable 


energy is given by I kE (two rotational degrees of freedom— 


see pp. 192-93). Thus, the number of the energy level at which a 
molecule is most frequently located may be easily calculated. For 
example, in the case of a tnolecule of a hydrochloric acid vapour 
(J =2.61 x 10~*° gm cm’), at temperatures of 300°, 600° and 1,200° kK, 
we obtain j=4, 6 and 8, respectively. 

Since transitions are possible only between neighbouring levels, 
a series of equally spaced frequencies will be grouped about the line 
of “average” j-value. The line intensity decreases as its distance from 
this j-value increases, since the number of molecules in the corre- 
sponding energy state decreases. 

Rotational spectra enable us to determine interatomic distances 
in simple molecules to a very high degree of accuracy (much ‘greater 
accuracy than by diffraction methods). Thus, if the number of atoms 
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in a molecule is notlarge, the distances between atoms may be deter- 
mined if the moment of inertia and the masses of the atoms are known. 
For a diatomic molecule, 


r= ye where m=— MyM 
> m,+m,° 


In the case of a hydrochloric acid molecule: 
my =1.67x10-% gm; mey = 85x 1.67 107% gm. 


The separation between the H and Cl atoms in an HCl molecule is 


= 2.61X107*X36X1.67X107—74 —s 
a yf BERTIE OGEXTSTXTO™™ _ 1.6310 cm. 


This value agrees closely with values obtained by other means. 


201. Infrared Vibration- Rotational Spectra 


This type of spectrum may be observed in a wavelength band 
extending from 2-3 to several score microns. For brevity, the vi- 
bration-rotational absorption spectrum is referred to as the “infrared 
spectrum”. In the case of solids, where there is no molecular rotation, 
a pure vibrational spectrum is obtained. In the case of liquids, where 
rotation is impeded, the rotational structure of the band is smeared. 

Diatomic Molecules. Let us disregard rotation for the present 
and consider vibrational energy levels. 

The vibration of a diatomic molecule may be visualised by means 
of a simple model consisting of two spheres joined by a spring. In 
such a system, the natural frequency of oscillations is given by 


EVE. 


where k is the stiffness coefficient determining the binding force and 
m is the mass of an atom when the atoms in the molecule are the same; 
when the masses differ, m is the reduced mass, which is equal to 


i's _ (we leave the proof of this to the reader). Quantum mechanics 
shows that the energy of an oscillator is given by the formula 
1 
C= (v - x) hv 
Here, + hy is the zero-point energy of the oscillator, i.e., the oscilla- 
tion energy at absolute zero, and v—0, 1, 2,... is the oscillation quan- 
tum number. Moreover, it is shown in quantum mechanics that in 
the case of harmonic oscillators energy transitions may occur only 


between neighbouring levels. In the case of nonharmonic oscillators 
transitions skipping one level or more occur, but these are 
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weaker than the main transitions. Harmonic oscillations occur 
under the action of a restoring forcee—kx. The potential energy of 


2 
such oscillations is, i.e., the shape of the curve is parabolic. 

Fig. 228 shows a potential curve (and an inscribed parabola) for 
a diatomic molecule. The horizontal lines represent energy levels 
based on theoretical calculations. For low values of energy, the devi- 
ation of the potential curve from 
a parabola is negligible. Such a 
molecule may be expected to 
obey the harmonic oscillator law 
as long as the vibrational ener- 
gy is much less than the dis- 
sociation energy of the mole- 
cule. Under such conditions, the 
vibrational levels may beconsid- 
ered to be equally spaced, and 
since only transitions between 
neighbouring levels are allowed, 
the diatomic molecule will pos- 
sess a single transition frequency. If there is no molecular rotation, 
the entire spectrum will consist of a single line. Actually, in 
addition to the main frequency v, the spectrum contains the 
“overtone” frequencies 2v, 3v, etc. (as the separation between 
levels decreases, the proportional trend of the overtone frequencies 
is lost). However, the overtones are weak and in very many cases 
we have a right to speak of a single vibration frequency. 

The presence of molecular rotation will transform such a spectral 
line into a band. If a molecule vibrates and rotates simfultaneously, 
its energy is determined by the two quantum numbers v and j: 











é= (v +5) hVgip tye i I+ 1). 


The frequencies obtained now fall into two groups, one less than 
and one more than the vibration frequency v,;,. These groups are 
known as branches and are designated by the letters R and P. 
Taking into account the selection rules discussed above, we obtain 
the following frequency formula: 


Nhe 275 
V= Vota I (j=1, 2,...). 


The plus sign corresponds to transitions to higher rotational leveis 
and the minus sign to lower rotational levels. 
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This is shown in Fig. 229, which illustrates the spectral band of 
HCl. The point O corresponds to v,,,, and the vertical lines to the 
right and to the left indicate the obtained frequencies. The height of 
a line is proportional to the intensity at the given frequency. When 

R the resolution is high, 
each line appears dis- 
tinct. On the other hand, 
when the resolution is 
low, the lines merge into 
a band the intensity de- 





T= 100°K 


gr O pendence .of which is 

7=500°% f given by the envelope 
of the spectral lines. In 

Fig. 230, we see a dia- 

R 0 p gram of the energy tran- 


o this band. It should be 

noted that a pure vibra- 

Fig. 229 tional transition (from 

j’=0 to j” =0) is forbid- 

den and as a result there is a gap in the middle of the band. There 

is an absorption maximum to the right as well as to the left of the 

vibration frequency. For the reason discussed in the preceding ar- 

ticle, the absorption maxima occur 

for the j-values which are most 

frequently encountered at the given 

temperature. Therefore, as the tem- 

perature increases, the shape of the 

spectral bard changes as shown in 
the diagram. 

Vibrations of a Polyatomic Mol- 
ecule. A polyatomic molecule may 
execute a large number of vibra- 
tional motions. This number is 
equal to the number of vibrational 
degrees of freedom of the molecule 
and may be calculated as follows. ay 
A molecule consisting of N atoms j6§439210,12345676 170 
has 3N degrees of freedom. Three LELititrt it i tt 





of them are associated with the Fe wih DP 
coordinates of the molecule’s centre Fig. 230 


of mass. In the general case, the 
number of rotational degrees of 
freedom is also equal to three. But linear molecules have only two 
rotational degrees of freedom since rotation about a line passing 
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through thecentres of the atoms is physically meaningless. Thus, the 
number of vibrational degrees of freedom and, hence, the number of 
vibration frequencies is equal to 3V—6 or 3N—5. If the dipole mo- 
ment of the molecule does not change for a given vibration, the cor- 
responding frequency will not be manifested. (We shall return to the 
problem of so-called inactive vibrations later.) Be that as it may, the 
number of vibration frequencies and, hence, the number of bands 
in the infrared spectrum, is strictly determined by the number of 
atoms in the molecule and by its symmetry. 

In the absence of molecular rotation, i.e., in the case of solids, 
an infrared absorption spectrum consists of lines which correspond 
to vibrational transitions. Since, perforce, thick layers are used in 
such an investigation, there is considerable absorption under normal 
conditions and the lines merge into a band. In liquids, molecular 
rotation is retarded and the rotational structure of such a band will 
be smeared, i.e., individual lines can no longer be detected. 

Now, let us consider the physical meaning of vibrations in a poly- 
atomic molecule. Actually, what kind of vibrations occur? In the 
case of a diatomic molecule the situation was clear, i.e., we were 
dealing with vibrations along a bond line. What quantities vibrate 
harmonically in polyatomic molecules? 

For any molecular vibration, the deviations of atoms from their 
equilibrium positions may be described by displacements along a 
bond and by the distortion of bond angles. The instantaneous con- 
figuration of a vibrating molecule may be completely described by 
(83N—6) g; coordinates (using the word coordinate in its broad sense). 
For an arbitrary choice of the g; coordinates, their values will not 
obey a simple vibration law. The law of change with respect to time 
of each g; can be represented by a complex, albeit periodic, curve. 
However, it turns out that it is possible to describe a vibrating 
molecule by (3V—6) Q; numbers which vary harmonically with 
frequencies v; These Q; “coordinates” are called normal coordi- 
nates and the frequencies v; are called normal vibration frequen- 
cies. 

The fact that it is possible to introduce normal coordinates means 
that the periodic curves of change of any gq; coordinates may be 
resolved into spectra of normal vibration frequencies. We can al- 
ways assume that a vibration spectrum consists of normal vibration 
frequencies. 

What is the nature of Q; coordinates? Are they obtained only for 
a particular choice of coordinate system? The answer to the latter 
question is no. First and foremost, normal coordinates are linear 
combinations of g,; displacements. Therefore, a normal coordinate 
describes the vibration of a molecule as a whole. Examples of normal 
vibrations are illustrated in Figs 231 and 232 for CO, and H,O 
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molecules. The actual vibration of a molecule is the resultant of the 
indicated motions. 

The normal vibration frequencies of a molecule can be determined 
from its spectrum. These can then be used to obtain a clear picture 
of the molecular vibrations. 

The characteristic natureof many 0 
vibration frequencies is of eet * 


ae oy 
0 No # 





b) oe . ie 
b) ee yf mS 





Fig. 231 Fig. 232 


practical importance. Careful study has shown that basically in 
certain normal vibrations only one interatomic spacing or one bond 
angle varies. If a molecule preserves that bond, such a frequency 
varies little in a group of related compounds. This fact is utilised in 
chemistry. 

The vibration frequencies of a molecule are measured not only by 
means of infrared absorption spectra but by means of Raman spectra 
as well. As will be seen below, these two methods effectively supple- 
ment each other. 


202. Raman Scattering of Light 


Raman scattering refers to the particular case of the scattering of 
light of frequency v by a substance when, in addition to the strong 
scattering of light of constant frequency v, there appears a series of 
lines of lower and higher frequencies. 

Usually, observations are made at right angles to the incident light. 
A mercury lamp provides the required radiation. The spectrum of 
this radiation contains several intense lines, the most important of 
which is a blue line corresponding to a wavelength of 4,358 A. By 
means of a spectrograph, one can obtain a photograph of the scattered 
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radiation spectrum. Such a photograph is shown in Fig. 233. The 
main characteristic of such a spectrum is the following. About each 
excited line there appear identical groups of considerably weaker 
lines. These satellites are usually spaced symmetrically to the right 
and to the left, but they may differ in intensity. This phenomenon 
was discovered independently by Raman in India and Landsberg 
and Mandelshtam in the Soviet Union. Raman’s work, however, was 
published first*. Hence, such spectra 
are called Raman scattering spectra. 

The spectral pattern can be ex- 
plained as follows. Basically, a photon 
hv is scattered by a molecule elasti- 
cally, i.e., the frequency remains 
constant. However, in addition to 
such scattering, it is also possible ; 
to have scattering with some loss of Fig. 233 
energy; such energy may be expended 
in the transition of a molecule from one level to another. 
Let us assume that a photon Av has lost an amount of energy equal 
to that required to raise a molecule from the zero vibrational level 
to the first level. The energy loss is @s=1—@v=0 =AV,ip. There- 
fore, the scattered photon has an energy A(v—v,;,). An associated 
line or “satellite” appears in the spectrum on the side of lower fre- 
quenicies. 

The lower frequencies, v—v,;,, are called red satellites and the 
higher frequencies violet satellites. Scattering with a frequency 
greater than v occurs when a photon hits an excited molecule. In 
such a case, the photon is scattered, but it simultaneously gains the 
“extra” energy due to the transition of the molecule to a lower level. 
If the excited molecule was at the first vibrational level, the photon 
increases its energy by hv,;, and the frequency v-+-v,,, appears in 
the spectrum. 

This scattering mechanism excellently explains the difference 
between the intensities of the red and violet lines. At room tempera- 


: 1 
ture, most molecules are at the zero level with an energy = Wow: 
A smaller number of molecules are at the first excited level with an 
energy > hvyip. Therefore, it is clear that the intensity of the violet’ 


lines must be less. Moreover, at low temperatures the violet lines 
practically disappear. The ratio of the intensities of the violet lines 
to the red lines is proportional to the ratio of the number of atoms 
in the first state to the number in the zero state. According to the 








* Raman sent a telegram about his discovery to the British journal Nature. 
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Boltzmann law, 
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This formula agrees excellently with experimental results. 

Thus, the displaced lines of a Raman spectrum are shifted by 
amounts equal (in energy units) to the difference between the 
energy levels of the given molecule. 

We have discussed the case of two vibrational levels, but it is 
clear that the discussion is valid for all energy transitions—pure 
rotational, vibration-rotational, etc. The satellite lines closest to 
the main scattering line correspond to the lowest energy transitions. 
The rotational spectrum is located considerably closer to the main 
line than the vibration-rotational spectrum. 

In the case of absorption spectra, the selection rules for the oscilla- 
tion quantum number are the same as those for Raman scattering, 
but the selection rules for the rotational quantum number are differ- 
ent. Transitions are allowed for which 


Av=-+1 and Aj=0,+2. 


Thus, the vibration-rotational band consists of a pure vibrational 
line displaced from the excited line by v,;, and a series of lines 
displaced from the excited line by v,;,—2v,,; and v,;,+2v,o1- 

Raman spectra are usually obtained by scattering from liquids. 
The lines v,;,+2v,., appear smeared, but the lines of the pure vi- 
bration spectrum are distinct. 

Raman spectra have an important advantage over infrared spec- 
tra. The measurements are transferred, so to speak, to the visible 
region. The frequencies which were measured directly in the infrared 
spectrum are determined as the difference between the main line and 
the Raman line with approximately the same accuracy. 

It would seem that one could dispense with the infrared spectrum. 
However, this is not always the case. In certain respects, the infrared 
and Raman spectra supplement each other. 

What is the difference between the process of wave radiation from 
a molecule and the process of scattering from a molecule? In both 
cases a molecule sends wavelets into space, i.e., in both cases a mol- 
ecule behaves during radiation like a dipole. However, in the first 
case a molecule behaves like a dipole in the absence of an external 
field, while in the second it behaves like a dipole when acted upon 
by the field of an incident wave. Thus, radiation or absorption will 
occur when changes in the state of a molecule (vibration, rotation, 
etc.) are accompanied by changes in the induced dipole moment, 
i.e., in the polarisability B. Theoretical calculations indicate that 
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such a change should occur when the configuration of the molecule 
passes through equilibrium. 
Lines will occur in an infrared spectrum for vibrations which sat- 


isly the condition 
dp -y 
(i) an, 


Raman lines will occur for vibrations which satisfy the condition 


eure 


Quite often these conditions exclude each other. Therefore a certain 
vibration may be active in the infrared spectrum but inactive in 
the Raman spectrum, and vice versa. 

A CO, molecule may serve as an example. One of such a molecule’s 
three vibrations—the linearly symmetrical vibration—leaves the 
dipole moment unaltered and equal to zero. That vibration is inac- 
tive in the infrared spectrum. In the Raman spectrum, on the other 
hand, only that vibration will be active; the other two will be absent. 
In the case of an anti-symmetrical vibration, one may reason as 
follows: In both extreme positions, the deformation of the electron 
cloud, and hence the polarisability, is the same. During vibration 
the polarisability changes in the same manner in both half-periods, 
and at the equilibrium position passes through a minimum or max- 
imum, but this does not mean that 


We shall not discuss these regularities any further. They have 
been studied in detail and the results are available in tabular form. 
Such tables enable us to determine from the symmetry of a molecule 
the number of vibration frequencies in its infrared and Raman spectra. 
The converse is also true, namely, the symmetry of a molecule can 
be determined from the number of lines in its spectra. 


203. Electron Spectrum 


Let us consider absorption spectra in which the transitions are in 
the visible and ultraviolet regions. The magnitude of a photon of 
the incident light will be of the same order of magnitude as the differ- 
ence between the electronic levels of a molecule. Electron transi- 
tions become possible. However, as has been indicated already, elec- 
tron transitions are accompanied by changes in vibrational and 
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rotational energy. Therefore, a very broad band is associated with 
every such transition. Moreover, under normal experimental condi- 
tions this band is continuous, i.e., its “vibration-rotational” struc- 
ture is not discernible. Each electron transition band contains 
numerous narrow vibration-rotational transition bands, whereby all 
changes of the oscillation quantum numbers are possible. 

The properties of a molecule in an excited electronic state differ 
from its properties when it is at a zero electronic level. When a 
molecule becomes excited, the system of vibrational and rotational 
levels and hence the vibration frequencies, i.e., the differences 
between the vibrational levels, change. Also, the shape of the poten- 
tial curve and the equilibrium spacings between atoms change. 

Absorption curves in the visible and ultraviolet regions: are suf- 
ficiently characteristic to be used in identifying substances. 

The dependence of the absorption of light on the thickness of a 
layer of substance may be expressed as follows (cf. p. 119): 


|e We 


where /, is the intensity of the incident beam, / the intensity of the 
transmitted beam, x the thickness of the layer, and uw the absorption 
coefficient for light. The value of the absorption coefficient depends on 


the wavelength of the incident light. A curve of + as a function of 


wavelength is sometimes called an absorption’ curve. Usually, 
however, this term refers to a curve of p as a function of A or v. 
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Fig. 234 
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The relation for the absorption of light in solutions may be written 


in the form 
p= Leo? or [= bet, 


where / is the distance traversed by the beam in the substance 
and kN and ec are expressions for the absorption coefficient of the 
solution. It is quite reasonable to assume that the absorption coef- 
ficient is proportional to the concentration of the substance, which 
may be expressed as the number of molecules N per unit volume or 
the number of moles of substance ¢ per litre of solution. In the case 
of solutions, the term “absorption curve” usually refers to a curve 
showing the dependence of the coefficient & or e on A. 

Examples of absorption curves in the visible and ultraviolet spec- 
trum are shown in Fig. 234. Curve / is for Congo red, 2—for aniline, 
3—for phenol and 4—for benzene. 


CHAPTER XXX 
ATOMIC NUCLEI 


204. Experimental Methods of Nuclear Physics 


Investigation of the structure of atomic nuclei is inseparably linked 
with the study of spontaneous and induced decay of atomic nuclei 
and nuclear particles. By studying the fragments of a disintegrat- 
ing atomic nucleus and tracing the fate of these particles, we can 
make certain inferences about the structure of the nucleus and about 
the nuclear forces. 

It is not surprising that the spontaneous decay of nuclei, i.e., 
natural radioactivity, was the first to be studied in detail. Concurrent- 
ly, physicists began to study cosmic rays—radiation from outer space 
possessing extraordinary penetrating power. In interacting with mat- 
ter, cosmic particles behave like projectiles. For a long time, cosmic- 
ray investigations were the primary means of studying transforma- 
tions of elementary particles and to a certain extent of studying 
atomic nuclei. At present, streams of particles created in accelerators 
are the primary means of studying the disintegration of atomic nuclei. 

The experimental methods to be discussed below are applicable to 
the study of cosmic rays and particles created during the nuclear bom- 
bardment of one or another target. 

Wilson Cloud Chamber. When a fast particle passes through a cham- 
ber containing supersaturated vapour and creates ions along its path, 
a track is left that is very similar to the “tail” occasionally seen in the 
sky after a plane has passed. This track is produced by condensation 
of the vapour. The ions along the path of the particle are centres of 
vapour condensation; as aresult the track is easily detected. The track 
of the particle may be photographed or observed directly. 

The state of the vapour in thechamber may be controlled by vary- 
ing the volume of the chamber. This is achieved by means of a piston. 
Rapid adiabatic expansion brings the vapour to a state of supersatu- 
ration. 

If a Wilson cloud chamber is located in a magnetic field, the veloc- 
ity of a particle in the chamber may be determined from the curva- 
ture of its path if e/m is known or, conversely, e/mmay be determined 
if the velocity is known (see the formula on p. 442). 

lonisation Counters and Jonisation Chambers. An ionisation de- 
vice used in radiation investigations usually consists of a cylindri- 
cal condenser filled with gas. The cylindrical casing constitutes one 
electrode and a thin insulated wire placed along the axis of the cylin- 


“Sec. 204] Experimental Methods of Nuclear Physics 529 


‘der the other (Fig. 235). The magnitude of the voltage applied to the 
condenser and the pressure of the gas in the counter must be chosen in 
‘a special manner, depending on the nature of the problem. In one 
such widely used device, known as a Geiger counter, a voltage equal 
to the breakdown voltage is applied between the cylinder and the 
wire. When an ionising particle enters such a counter through the 
wall or insulator, there passes through 
‘the condenser a pulse of current which 
continues to flow until the primary elec- 
trons and the created electrons and ions 
of the self-maintained discharge reach 
‘the positive casing of the condenser. 
‘This pulse of current can be amplified 
by ordinary radio-engineering means and 
‘the passage of ‘the-particle through the 
counter may be determined by: a click 
or- light flash, or by means of a digital 
counter. ; y os etn Ac 
---A digital counter can be used to deter- 
mine the number of particles entering 
the instrument. This requires that the 
pulse of current due to one particle cease 
by the time the next particle enters the 
counter. If the operating conditions of 
the counter are not chosen properly, the 
counter begins to “choke” and count 
incorrectly: The resolving power of an ionisation counter has an 
upper limit. This limit, however, is quite high, i.e., up to 10,000 
particles per second may be counted. ; 
- If we decrease the voltage, the pulse of current passing through 
_the condenser may be made proportional to the number of created 
fons (proportional counter). For this purpose, it is necessary to oper- 
ate in a region where the gas discharge is not self-maintained. Pri- 
mary electrons moving in the electric field of the condenser accumu- 
late energy, whereupon ionisation by collision commences and new 
ions and electrons are produced. The first n pairs of ions produced 
by a particle entering the counter are transformed into kn pairs of 
ions. When the operating conditions are such that the discharge ise 
not self-maintained, the amplification factor k is constant. Thus, a 
proportional counter not only establishes that a particle has passed 
through the counter, but also measures the ionising power of the par- 
ticle. 

Just like in the Geiger counter described above, discharge in a 
proportional counter ceases when no more ionisation takes place. The 
distinguishing feature of a Geiger counter is that a particle entering 





Fig. 235 
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such a counter behaves like a trigger mechanism, and the breakdown 
time is independent of the primary ionisation. 

Since a proportional counter is sensitive to the ionising power of 
a particle, the operating conditions of the counter may be chosen 
so that only certain kinds of particles are recorded by the instrument. 

If the operating conditions of the instrument correspond ‘to satura- 
tion current (achieved by decreasing the voltage), the current through 
the counter becomes a measure of the radiation energy absorbed 
in the instrument per unit time. In such a case, the device is called 
an ionisation chamber. The amplification factor k is then equal to 
unity. The merit of an ionisation chamber is its high stability of 
operation. The design of an ionisation chamber varies considerably 
from case to case. The choice of chamber filling, wall material and 
number and shape of electrodes will depend on the purpose of the in- 
vestigation. Chambers vary in size from about a cubic millimetre 
to several hundred cubic metres. Under the action of a constant 
source of ionisation, currents ranging from 10~*’ to 10~” ampere are 
produced in ionisation chambers. 

Scintillation Counters. The method of counting elementary par- 
ticles by the flashes of a fluorescent substance (scintillations) was 
first used by Rutherford in his classical investigations of the struc- 
ture of atomic nuclei. Modern instruments bear little resemblance 
to the simple device used by Rutherford. 

A particle impinging on a phosphor* may produce a flash of light. 
There exist a large number of organic and inorganic substances which 
are capable of transforming the energy of charged particles and pho- 
tons into luminous energy. The duration of afterglow in many phos- 
phors is very short— of the order of athousand millionth of a second. 
This makes it possible to construct scintillation counters with large 
counting rates. The light yield of a number of phosphors is propor- 
tional to the energy of the incident particles. Such phosphors find 
application in the construction of counters for the determination of 
the energy of particles. 

In modern counters phosphors are combined with photomultipli- 
ers having ordinary photocathodes that aresensitive to visible light. 
The electric current produced in a photomultiplier is amplified and 
fed to a counting device. 

The most widely used organic phosphors include anthracene, stil- 
bene and terphenyl. These chemical compounds belong to the class 
of so-called aromatic compounds, which contain rings of six carbon 
atoms. For use as scintillators, these substances must be obtained in 
the form of monocrystals. Since large crystals are rather difficult to 


* Phocphors constitute a large group of solids which, generally speaking, 
do not have anything in common with the chemical element phosphorus. 
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grow and since crystals of organic compounds are very fragile, the 
use of plastic scintillators, i.e., solid solutions of organic phosphors 
in transparent plastics such as polystyrene and other such high-poly- 
mer materials, is of considerable interest. Of the inorganic phos- 
phors, halides of alkali metals, zinc sulphide and tungstates of alkali 
earth metals find application. 

Cherenkov Counters. As far back as 1934, Cherenkov showed that 
when a fast particle moves in a perfectly pure liquid or solid dielec- 
tric a peculiar luminescence occurs. This luminescence basically dif- 
fers from fluorescence, which is related to energy transitions in atoms 
of the substance, and from bremsstrahlung of the continuous X-ray 
spectrum type. Cherenkov radiation occurs when a charged particle 
moves with a velocity exceeding the phase velocity of the propaga- 
tion of light in the dielectric. The distinguishing feature of this radia- 
tion is that it is propagated along a conical surface in the propaga- 
tion direction of the particle. The angle of the cone may be deter- 
mined from the formula 

v 

cos =p 

where § is the angle between the surface of the cone and the direc- 
tion of motion of the particle, V is the velocity of the particle, and 
v is the velocity of light in the medium. Thus, for a medium having 
a given index of refraction n, there exists a critical velocity, V= 


= vas, below which no radiation occurs. At this critical velocity, 


the radiation is parallel to the direction of motion of the particle. 
For a particle moving with a velocity very close to the velocity of 


light (V=c), the angle of radiation ( =arccos + is a maximum. In 


the case of cyclic hexane, n=1.437 and § =46°. 

Theoretical calculations and experiments show that, in the main, 
the Cherenkov radiation spectrum is located in the visible region. 

Cherenkov radiation is a phenomenon similar to the formation of 
a nose wave near a moving ship; in such a case, the velocity of the 
ship is greater than the velocity of the waves on the surface of the 
water. 

Fig. 236 illustrates how Cherenkov radiation is formed. A charged 
particle moves along the axial line and the electromagnetic wave 
following the particle temporarily polarises the medium at points of 
the particle path. All such points become sources of spherical waves. 
There exists a certain angle for which these spherical waves coincide 
in phase and form a single front. 

Let us consider two points along the path of the charged particle 
(Fig. 237). Two spherical waves have been created—one at the in- 
stant of time ¢ and the other at the instant of time f-+17. Clearly, t is 
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the time required for the particle to traverse the distance between 
these two points. In order for these two waves to be propagated at an 
angle @ in the same phase, the time of travel of the first beam must be 
greater than the time of travel of the second by t. During the time 
t, the particle traverses a distance Vt and the wave a distance ot. 


Thus, we obtain the formula cos t=> F 


Cherenkov radiation is widely used as a means of registering ele- 
mentary particles. Counters based on this phenomenon are known as 
Cherenkov counters. Like scin- 
tillation counters, they contain 
a luminescent material, photo- 
multipliers and amplifiers of 
photoelectric current. Various 
types of Cherenkov counters have 
been designed. 

Such counters have numerous 
merits. These include a_ high 
counting rate and the ability to 
determine the charge of particles 
moving with velocities very close 
to that of light (the curve of light 
yield as a function of particle 

vt charge rises sharply). Only by 
6 means of Cherenkov counters is 
it possible to solve such impor- 
Vr tant problems as the direct de- 
; termination of the velocity of a 
Fig. 237 charged particle, the determina- 
tion of the direction in which 
an extremely fast particle moves, etc. 

_ Arrangement of Counters. To study transformation and interac- 
tion processes of elementary particles, one must be able to detect the 
emergence of a particle at a particular point and trace its subsequent 
course. Various arrangements of standard counting circuits are used 
in solving such problems. For example, two or more counters may be 
electrically connected in such amanner that a count occurs only when 
a discharge begins in all the counters at exactly the same time. 
This may serve to indicate that a certain particle has passed through 
all the counters. Such an arrangement of counters is known as a 
“coincidence” arrangement. 

Nuclear-Emulsion Method. A gelatinous film containing micro- 
crystals of silver bromide may serve as the photosensitive layer of 
a photoplate. Basically, the photographic process consists in the ion- 
isation of these crystals, resulting in the reduction of the silver bro- 


Fig. 236 
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mide. This process occurs under the action of light or of charged par- 
ticles. A concealed track is formed in the emulsion when a charged 
particle passes through it. This track may be seen after the photo- 
plate is developed. Photoemulsion tracks tell us a great deal about the 
particles producing them. Thus, strongly ionising particles leave heavy 
tracks. Moreover, since the ionisation produced depends on the 
charge and the velocity of the particle, considerable information may 
be obtained by simply examining the appearance of the track. The 
range of a particle in the photoemulsion is also a source of valuable 
information. By measuring the length of the track, one may deter- 
mine the energy of the particle. 

Ordinary photoplates, having thin layers of emulsion, are hardly 
suitable for nuclear investigations. Such plates would register only 
those particles the motion of which is strictly in the plane of the 
plate. Misovsky and Zhdanov in the Soviet Union, and several years 
later Powell in Britain, introduced the use of photoplates having an 
emulsion thickness of almost 1 mm (one hundred times greater than 
the thickness in ordinary plates). The complex transformations 
occurring when a particle disintegrates are registered in clear visual 
form in the photographic method. 

Fig. 238 shows a typical photograph obtained by this method. 
Nuclear transformations have occurred at P and S. 

In a recent modification of this method, an emulsion chamber of 
considerable volume is used as the medium for registering the parti- 
cle track. 

Methods of Analysing Observations. Using the described instru- 
ments, an investigator is able to determine the most important con- 
stants of an elementary particle, viz., velocity, energy, electric 
charge and mass. All-these parameters may be determined very accu- 
rately. Moreover, when a stream of particles is available, it is possible 
to determine the spin and magnetic moment of an elementary par- 
ticle. For this purpose, a magnetic field is used to divide the beam 
(see p. 492). 

It should be recalled that only charged particles can be observed 
directly. Our knowledge of neutral particles and photons is ob- 
tained indirectly, i.e., by determining how these invisible particles 
aflect charged particles. Nevertheless, our knowledge of invisible 
particles is highly reliable. 

The laws of conservation of momentum and energy have wide ap 
plication in the investigation of elementary particle transformations. 
Since we are dealing with fast particles, changes in mass must be 
taken into account in applying the law of conservation of energy. 

- Consider a particle track in which “branching” occurs. This signi- 
fies that the first particle has been transformed into two particles. 
Therefore, the following relations must hold. First, the momentum of 
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the first particle must be equal to the vector sum of the momenta of 
the created particles: | 


P.=P.+Ps- 
K,=K, +K,+A6, 


where AG =c*Am (the increment Am is the difference in mass between 
m, +m, and m,). 

‘Nuclear physics experiments show that the laws of conservation 
are strictly obeyed in all elementary particle transformations. Hence, 
these laws may be used to explain the properties of neutral particles, 
which do not leave tracks in a photographic emulsion and do not 
ionise a gas. When an investigator observes two diverging tracks on 
a photoplate, he knows that at the branching point a neutral parti- 
cle transformation has occurred. By determining the momentum, 
energy and mass of the created particles, he may reliably ascertain 
‘the values of the parameters of the neutral particle. This is how the 
neutron was discovered and how neutrinos and neutral mesons are 

studied (see sili 


Secondly, 


205. Nuclear Particles 


Atomic nuclei consist of protons and neutrons. The basic ee en 
istics of a proton, like any other elementary particle, are its charge, 

mass, spin and magnetic moment. A proton has a positive elementary 
electric charge, i.e., its charge is equal in magnitude, but opposite 
in sign, to the charge of an electron. Its mass is equal to 1.6724x 
x 10-** gm, which is 1,836 times greater than the mass of an electron. 


It has a spin of ; and a magnetic moment of 1.41 lor" Gaussian 


unit. : 
A neutron has a somewhat greater mass than a proton, i.e., its 


mass is equal to 1.6737 x 10~** gm. It also has a ‘spin of. 5+ lis magnet- 


ic moment is anti- parallel to the spin and equal to 0. 966 x 10-*° Gaus- 
sian unit. 

A neutron does not have an lasted charge and does not leave a 
track in a Wilson cloud chamber or on a photoplate. The properties 
of a neutron are primarily determined by studying collisions between: 
neutrons and various nuclei. Knowing the mass and velocity of a nu- 
cleus which collides with a neutron, one can determine the velocity 
Uo. and the mass M,,,,. Thus, in accordance with the laws of elas- 
tic impact (see pp. 64-65), we obtain _ 


v = 2M nent zu 
Gare M neut ae M nicl ia 
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where M,eu¢ and v,,,,4 are unknown quantities. By studying collisions 
between neutrons and various nuclei, onecan roughly determine M,,on¢ 
if it is assumed that the initial velocity v,,,, is the same in different 
collisions. A precise value for M,,,,,; may be determined from the values 
of the mass defect of nuclear reactions (see below). 

The spin and magnetic moment of neutrons have been directly 
determined from very interesting measurements on a stream of neu- 
trons passing through magnetised iron. However, we shall not de- 
scribe these measurements. 


206. Mass and Energy of an Atomic Nucleus 


The number of protons Z in the nucleus of a given element deter- 
mines thechemical properties of the element, and the position of an 
atom in the Mendeleyev periodic table is determined precisely by the 
number of its protons Z. 

A chemical element may have several isotopes, which differ from 
one another with respect to the number of neutrons in the nucleus. An 
isotope of a given element may be described by its mass number M, 
the total number of protons and neutrons in its nucleus *. Thus, 
the number of neutrons in a nucleus is equal to M — Z. 

Ordinary natural substances are a mixture of isotopes. The isotop- 
ic composition of a natural substance generally remains fixed and 
is, therefore, characteristic of the given chemical element. Frequently, 
one of the isotopes of the mixture predominates. For example, hy- 
drogen is encountered innaturein the form of ordinary hydrogen H!' 
and deuterium H’=D, whereby the percentage of the former is 
99.98% and that of the latter 0.02%. The percentage of O'* in natu- 
ral oxygen is 99.76%. In natural uranium the main isotope is U?**; 
its percentage is 99.28%. 

Let us represent the mass of the lightest and most prevalent isotope 


of oxygen by M,. The magnitude eM, is called an atomic weight unit. 


It is customary to express the atomic weight A of isotopes and ele- 
ments in such relative units. Precise measurements indicate that an 
atomic weight unit is equal to a mass of 1.662 10-** gm. The abso- 
lute value of the mass of an isotope may be determined from the 
formula. 

M,= 1.662 x 10-** A gm. 


* The nucleus of an isotope of a given chemical element is denoted by the 
symbol of the element. The mass number is indicated by a superscript to the 
right. A subscript to the left is frequently used to indicate the atomic number 
Z of the element, but this is not necessary since the chemical symbol deter- 
mines Z. Thus, the three isotopes of oxygen may be denoted by the symbols O', 
O” and O' or 0", ,O" and ,O"*. The nuclei of these isotopes contain 8, 9 and 10 
neutrons (M—Z), respectively. 
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The mass of a proton is 1,836 times greater than the mass of an elec- 
tron. Therefore, the mass of an atom and the mass of its nucleus 
are almost equal. However, in a number of cases, particulagly for 
light atoms, this difference may be determined by means of modern 
measuring methods and should be taken into account. It is evident 
that the following relationship exists between the mass of an atom 
M, and the mass of its nucleus My. 


In atomic weight units, m=5.5 x 107**- Thus, the difference between 
My and M , is of the order of several hundredths of a per cent of these 
masses (for heavy atoms—of the order of several thousandths of a 
per cent). . 

The relative atomic weight of an isotope is approximately equal 
to its mass number. For example, the mass of H’ is equal to 1.00812, 
of D* to 2.01472, of Ne*® to 19.9981, etc. 

The following important conclusion may be drawn from a careful 
study of a table of isotope masses: the mass of a nucleus is less than 
the sum of the masses of its constituent particles. For example, the 
mass of a neutron is -1.00893 and the mass of a proton is 1.00812; 
hence the sum of the masses of two neutrons and two protons is equal 
to 4.0341; but the mass of a helium atom, consisting of two neutrons 
and two protons, equals 4.0039. Thus, the mass of a helium nucleus 
is 0.0302 of an atomic weight unit less than the sum of the masses of 
the constituent particles of the nucleus. This value is a thousand 
times larger than the accuracy of the measurements. 

This difference between the sum of the masses of the constituent 
particles of the nucleus and the mass of the nucleus is an important 
example of mass defect. Every nucleus has a specific mass defect. 

One of the most important conclusions of the theory of relativity 
is the principle of the equivalence of mass and energy (see p. 422). 
This principle states that if a system acquires or loses a quantity of 
energy A@, the mass of the system increases or decreases, respec- 
tively, by Am=A@/c*. The mass defect of anucleus is easily explained 
by means of this principle, i.e., it is ameasure of the binding energy 
of the nuclear particles. 

Let us see what this means. In chemistry and physics, binding ener- 
gy is understood to mean the energy required to completely break a 
bond. If a nucleus could be divided into its constituent particles, 
the mass of the system would increase by the value of the mass de- 
fect Am. From the viewpoint of Einstein’s law, this means that the 
nucleus has been given an energy A@ =c’Am, which is equal to the 


. * This value is obtained by dividing the mass of anelectron in grams by 
1.662 10~24 gm. 
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binding energy. Hence, a change in mass of one atomic weight unit 
is equivalent to a change in energy of 


1.662 x 10-**x9x« 10° erg= 1.496 107° erg =931.8 Mev 


(l ev = 1.601x10-** erg and 1 Mev=10° ergs). Using these values 
and knowing the values of the mass defect, one can easily calculate 
the binding energy of an atomic nucleus. 

Fig. 239 shows acurve of binding energy per nuclear particle, i.e., 
c’?A m/M, as a function of mass number. It is seen that the binding 
energy per nuclear particle first rises rapidly, although not quite uni- 
formly, then remains at approximately 8 Mev, and finally drops 
slightly for the last elements in the Mendeleyev periodic table. The 
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Fig. 239 


following conclusion may be drawn from the fact that the energy 
remains constant at 8 Mev over a large portion of the curve: Since the 
binding energy per particle does not depend on the total number of 
particles in a nucleus, interaction in a nucleus must occur only be- 
tween particles that are very close to each other. It follows that nu- 
clear forces between particles become effective only when one parti- 
cle approaches another very closely (see below). 

It is instructive to compare the 8 Mev value with the chemical bind- 
ing energies of molecules. The latter are usually equal to several 
electron volts per atom. Thus, the energy required to break up a nu- 
cleus is several million times greater than the energy required to 
break up a molecule into atoms. 

Nuclear forces will be discussed in greater detail below. It is al- 
ready clear from the above examples, however, that these forces reach 
tremendous values when a nucleus breaks up. It is also clear that 
nuclear forces constitute a new kind of force, since they can bind 
together particles the electric charges of which are of the same sign. 
Nuclear forces are not reducible to electrical forces. 
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207. Radioactivity 


Substances in which atomic nuclei are continuously disintegrating 
are said to be radioactive. Two kinds of radioactive disintegration 
exist: a-decay and B-decay. 

Alpha decay consists in the ejection of an a-particle from an atom- 
ic nucleus. Since an a-particle consists of two protons and two neu- 
oe its symbol is ,He*. Thus, a-decay may be represented as fol- 
OWS: 

sb =, ci Me, 


where El is an arbitrary chemical element. This disintegration reac- 
tion, as well as other nuclear reactions to be considered on p.567, obeys 
the law of conservation of charge (the sum of the subscripts in the 
right member of the equation equals Z) and the law of conservation 
of mass number (the sum of the superscripts in the right member of 
the equation equals M). 

Beta decay consists in the ejection of an ordinary electron (B~) or 
a positron (B*) from a nucleus. It should be recalled that the masses 
as well as the magnitudes of the charges of these two particles are 
equal. The ejection of such a light electrically charged particle 
from a nucleus results in the nuclear transformation of a proton into 
a neutron or a neutron into a proton. Such a transformation ensures 
the conservation of electric charge upon disintegration. 

Beta decay may be represented as follows: 

zEIM = ,, ,EIM+ B43 FY kee oie wa ere 
Thus, for B~-decay, aneutron of the nucleus is transformed into a pro- 
ton (the number of protons increases); for B*-decay, the reverse trans- 
formation takes place. 

In addition, y-rays, i.e., electromagnetic radiation of shorter 
wavelength than X-rays, may be emitted in both types of radio- 
active disintegration. Alpha radioactivity occurs only for the heavy 
elements (beginning with bismuth); B~-radioactivity is encoun- 
tered considerably more often than 6*-radioactivity. 

A radioactive substance found in nature is said to be naturally 
radioactive; a radioactive substance obtained by means of nuclear 
reactions is said to be artificially radioactive. 

If another element is formed when the nucleus of a radioactive ele- 
ment decays, and if a third element is formed from the second, etc., 
the sequence of such elements is called a radioactive series. Four radio- 


active series, beginning with U***, Th?**, U*** and U***, are known. 


* U3 has predecessors among the transuranic elements. 
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Radioactive decay obeys the law 
N=Ne~™ 


where N, is the number of nuclei present at the initial instant ¢—0, 
N the number of nuclei remaining (not disintegrated) after the elapse 
of atime ¢, and A a radioactive-decay constant, which is a constant 
for a given element. 

It is easily seen that the time interval T during shiich half of the 
number of atoms present decay : given by 


0.693 
T =I1n a aa 

The time interval T is Caowi as the half-life period or simply 
half-life of the radioactive element. 

The predecessors of the naturally radioactive series have a half- 
life lying in the range of 10°-10'° years. On the other hand, the half- 
life of intermediate decay products and artificially radioactive ele- 
ments may be equal to an extremely small fraction of a second. 

A quantity of radioactive matter could be expressed, of course, in 
grams. However, it is easier and more convenient to describe a quan- 
tity of radioactive matter by its activity, i.e., the number of decay 
events per second. The curie is a historical unit of measurement 
which is equal to 3.7 x 10"° decays per second. In laboratory work, this 
unit is found to be inconveniently large; hence, the millicurie, one- 
thousandth of a curie, is often used. Another unit used is the ruther- 
ford, which is equal to 10° decays per second. Thus, | millicurie= 
97 rutherfords. 

If the half-life T of a substance is known, its initial radioactivity 
can be easily determined. The fraction of substance decaying in | 
second is equal to 


N ao ae 
1 WN, = | —e 
or, since A is a small number, we obtain 


0.693 
1-7 aha. 


In the case of m grams of substance of atomic weight A, the activity 


is equal to 
0.693 m decays” 


T * Ax1.66X107 ~ sec 


It is generally assumed that the radioactive products found during 109 
days of operation of a nuclear reactor (see below) amount to | curie per watt. 
For 500,000 kw, the quantity of decay products is equal to about 500 gm, 
i.e., 107° gm per watt. Taking the average atomic weight of the fission products 
as 100, we find by means of the above formula that the average half-lite of the 
radioactive products is equal to 10* seconds, i.e., about 24 hours. 
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We cannot describe fully the properties of a radioactive substance 
by means of its half-life and activity alone. We must indicate, in ad- 
dition, whether the substance is an a-particle or f-particle radia- 
tor and whether the decay is accompanied by y-radiation. An even 
fuller description requires that we specify the energy of the par- 
ticles ejected from the nuclei and the energy of the radiation. The 
properties of a-particles radiated from one radioactive material dif- 
fer very little from the properties of 
a-particles radiated from another ra- U 
dioactive material. The initial veloc- 
ities of such a-particles lie in the 
range of 15,000-20,000 km/sec and 
the number of pairs of ions formed in 
air by such an a-particle lies in the 
range of 1x10°-2x10°. The ener- 
gies of B-particles ejected during decay 
are distributed continuously from zero 
to several hundred or thousand kev. 
The energies of y-rays emitted by one 
radioactive substance differ from the 
energies of y-rays emitted by another 


radioactive substance, but their order Fig. 240 
of magnitude remains the same for all 
elements. 


Now, let us direct our attention to the theory of a- and B-decay. 
In alpha decay, an a-particle tunnels through a potential barrier and 
is then subjected to electrostatic repulsion. The potential curve of 
a nucleus is illustrated in Fig. 240. We see a potential well and a po- 
tential barrier (beyond the barrier the electrostatic potential energy 
drops hyperbolically). It has been shown in the case of a radioactive 
element by scattering a-particles from nuclei of this element that the 
height of the potential barrier is at least 9 Mev. Nevertheless, parti- 
cles having an energy of only 4 Mev escape from nuclei by tunnel- 
ing through the barrier. 

Such a model explains why one radioactive element has a very 
short half-life and another has a very long half-life. If the difference 
between the energy of an a-particle in a nucleus and the height of 
the potential barrier changes even slightly, the probability ok 
a-particle leakage through the barrier changes radically (see the 
formula on p. 483). 

Now, let us consider B-decay. The existence of the neutrino, a parti- 
cle which eluded detection for a long time, was first hypothesised in 
explaining this process. 

Two arguments immediately present themselves in support of the 
existence of the neutrino. First, such a particle is required to satisfy | 
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the law of conservation of angular momentum and, secondly, it is 
required to satisfy the law of conservation of energy. We know that 
neutrons, protons, electrons and positrons have a spin of %. As has 
been indicated, B-decay in an atomic nucleus transforms a proton 
into a neutron, or vice versa. Since the number of nucleons in a nu- 
cleus remains unchanged, f-decay cannot transform an even spin into 
an odd one. But this is precisely what would be required if only an 
electron, having a spin of %, were ejected during B-decay. This con- 
tradiction was resolved by hypothesising the existence of the neutri- 
no, a particle having a spin of %. In addition, by means of the neu- 
trino, we can explain why the f-particle spectrum is continuous. 

If 6-decay consisted in the ejection of only an electron, this elec- 
tron would have a well-defined energy since the initial and final 
energy levels, i.e., the energies of the primary nucleus and the new 
nucleus, are well defined. But as has been indicated, a continuous 
spectrum of electrons, from a maximum velocity down to zero, is 
obtained. Such a spectrum can be explained by assuming that during 
disintegration two particles are ejected from a nucleus in accordance 
with the equation n=p-+e-+v. The energy is divided between the 
electron (or positron) and the neutrino in a random manner. 

The detection of a neutral particle having a negligibly small mass 
(we now know that the mass of a neutrino is less than 0.002 of the 
mass of an electron) is an extremely difficult problem. This problem 
was not solved until 1956. 

We have indicated that a neutrino is formed when a neutron disin- 
tegrates. A particularly large number of such decay events should 
occur in a nuclear reactor, where a tremendous number of nuclear 
fragments, rich in neutrons, are continuously formed. If neutrinos 
exist, a stream of such particles should emerge from a reactor. 
When a neutrino collides with a proton the following reaction should 
occur: v-+pt—et*-tn, i.e., a positron and a neutron wil! be formed. 
Such reactions should be observed in targets containing large num- 
bers of hydrogen atoms (i.e., protons) if they are placed close to a 
nuclear reactor. This reaction should occur very rarely (several times 
per hour) since a neutrino has extremely high penetrating power. 
At the same time, a large number of other nuclear reactions occur 
close to a reactor. 

The difficulties involved in detecting a neutrino are evident. They 
may be overcome by properly utilising the distinguishing features of 
this reaction. We know that the positron is quickly annihilated with 
an electron of one of the target atoms and that such annihilation yields 
two photons. The neutron, after covering a certain distance in the 
target, is absorbed by one of the impurity atoms (cadmium) added to 
the target for this purpose. The average life-time of a neutron before 
being absorbed has been calculated. It is equal to approximately 
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5p sec. The absorption of a neutron by cadmium is accompanied by 
y-radiation. Using modern measuring techniques, the experimenter 
must distinguish the following sequence of events from all others: 
the creation of two photons, followed in 5 p sec by a stronger pulse 
of y-radiation. Since this has been achieved, the existence of theneu- 
trino may be considered to be a fact. 

Alpha and beta decay obey the following formula of decay as a 
function of time: N=WN,e~’!. Thus, the decay of a nucleus is an inde- 
pendent event that does not affect the behaviour of other nuclei. All 
nuclei have the same decay probability. Assume that half of the 
nuclei disintegrate during an interval of time T. Since the remain- 
ing half is subject to the same conditions as the original group of 
atoms, half of the remaining half disintegrate during an equal inter- 
val of time. The fact that the decay of a nucleus is not dependent on 
the behaviour of its neighbours means that in a given time interval 
At the fraction of the number of atoms present which disintegrate, 


i.e., aN, will always be the same. This statement may be expressed 
as Jollows: 


ay Ad. 

By integrating this expression, we obtain the exponential law of de- 
cay 

ii is useful to remember that the reason why exponential laws are 
encountered so frequently in physics is that they are the mathemati- 
cal expression for decrease in accordance with the widespread rule 
that for equal changes in argurnent a function decreases by the same 
fraction of its magnitude. 


208. Spin and Magnetic Moment of a Nucleus 


Nucleons, the components of a nucleus, have a spin and hence a 
magnetic moment. Thus, the presence of spin is not peculiar to an 
electron alone. Elementary particles may have spin, and a visual 
interpretation of this fact is not only unnecessary, but incorrect. 
We have indicated already that the model of a particle rotating about 
its axis is completely without basis since the spin of a particle 
cannot be interpreted classically. 

The angular momentum of any particle and, hence of an atomic nu- 
cleus, is given by 


Sarah 
Vs(6+1)xz, 


and the projection of the spin on the selected direction line may 
assume 2s-+1 values in the interval s to —s. Usually, it is not the above 
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expression which is called spin, but the number s determining this 
expression. 

In accordance with the laws of quantum mechanics, the difference 
2s between the largest and smallest values of spin must equal a whole 
number or Zero. Therefore, the spin of a particle may equal 
oe eee 
Neutrons and protons, just like electrons, have a spin oft. 

Examining tables of spin values for various atomic nuclei, we ob- 
serve a number of interesting regularities. Thus, nuclei consisting 
of an even number of protons and an even number of neutrons (He, 
C’*, O'*) have zero spin. Evidently, the number of nucleons equal to 
a multiple of four generally plays an important role. In many cases, 
but by no means in all, the spin of an atomic nucleus may be deter- 
mined in the following manner: the number closest to M which yields 
a whole number when divided by four is subtracted from M and the 


result is multiplied b 4. For example: Li* has a spin equal to 
Y9 Pp q 


—_ er: 1 3 
25=1, Li? ~> , BY —1 and B* —-F: 


There are no exceptions to the following, rather obvious, rule: 
the spin of anucleus for which M is even is equal to a whole number or 
zero, and the spin of a nucleus for which M is odd is equal to an odd 
multiple of '%. 

The spin of an atomic nucleus may be determined from the hyper- 
fine structure of its optical spectrum. Even though the energy levels 
split to a very small extent, the differences in levels may be meas- 
ured very accurately. Splitting occurs as a result of the fact that dif- 
ferent mutual orientations of electron spin and nuclear spin correspond 
to different energies. 

Available data on nuclear spins indicate that the Pauli exclusion 
principle is applicable to the protons and neutrons of nuclei. Two 
identical particles can be located at a single energy level only if 
their spins are anti-parallel. Since a proton differs from a neutron 
two protons and two neutrons may be located at a single level. 
This compact group of four particles, having a total spin of zero, 
will be recognised as the nucleus of a helium atom (a-particle). 

If a particle has spin, it also has a magnetic moment. The angular 
momentum L is directly proportional to the magnetic moment M, 
but the magnetic moment may be either parallel or anti-parallel to 
the spin. 

If the spin of a particle (simple or complex) is given by s, the par- 
ticle’s magnetic moment may be written in the form 


M = gps, 
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where p is an elementary magneton equal tore, mis themass of the 


particle, and gis a dimensionless factor. This equation is the gener- 
al form of the relation given on p. 496 for the case of an electron. For 


such a particle, s =i and g must be taken equal to2 in order to ob- 


tain agreement with available experimental data. 

Different particles (elementary as well as complex) have different 
values of g. For example, the g-factor of a neutron is equal to 3.8206 
and that of a proton is equal to 5.5791. 

The value of an elementary magneton depends on the mass of the 
particle. However, it is customary to use only two magneton values: 
the Bohr magneton for light particles and the nuclear magneton (the 
magneton calculated for the case of a proton) for heavy particles, 


whereby pn = gg Hone The values of the g-factor given above are 


calculated for py. 
A theory of g-factors and magnetic moments which relates these 
properties of a nucleus to its structure does not exist. 


209. Magnetic Resonance 


Assume that a substance containing particles of spin s and magnet- 
ic moment M is placed in a constant magnetic field of intensity H. 
The potential energy of such a particle is equal to the scalar product 
MN=M,,H. According to the general law of quantum mechanics, this 
energy may assume only a discrete set of values corresponding to 
2s-+1 possible orientations of spin and magnetic moment. 

As usual, the resulting system of energy levels is determined by 
the energy transitions. The selection rules allow transitions between 
neighbouring levels only if the difference between their s values is 
equal to one. Assume, for example, that in one state 


M,= gus 
and in another 
M, = ge (s— ty, 
Hence, the difference in energy is 
6,—6, = guH. 


The energy levels will be equally spaced. 
The frequency of a radiated or absorbed quantum of energy corrc- 
sponding to the calculated difference in levels is 


— 8b 
vas HA, 
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For an electron 
v= 2.8 10°H 
and for a proton 
v=3.46x 10°H. 


It is seen that for each valueof H there is a corresponding charac- 
teristic frequency known as the magnetic resonance frequency. For 
the realisable range of field intensities, these frequencies lie in the 
radio band: in the case of nuclei, in the short and ultrashort wave- 
length region; in the case of electrons, in the centimetre wavelength 
region. 

Experiments and theoretical calculations show that in practice 
it is not possible to detect the radiation corresponding to these 
frequencies. However, it is possible to observe resonance absorp- 
tion of electromagnetic waves of corresponding wavelength. For this 
purpose, the substance is placed in a coil connected to a high-frequen- 
cy generator and the coil is then placed in a constant magnetic field. 
The resonance may be “trapped” by varying the field intensity while 
the frequency is kept constant or by varying the frequency while 
H is kept constant. Magnetic resonance is extremely selective. The 
width of the absorption peak is of the order of 0.1 mc/s at 460 me/s. 

The magnetic resonance method is widely used in studying various 
substances. Observation of electronic as well as nuclear resonance is 
of great interest. The presence of electrons having spins which are 
uncompensated for indicates to the chemist that so-called free radi- 
cals are present and enables him to determine the nature of chemical 
bonds. The chemical composition of a substance may be determined 
by means of nuclear resonance. But the following important fact 
should be noted. The magnetic resonance effect is so sensitive as to 
enable the detection of the superposition of an atom’s electronic 
cloud field on the external field. It turns out that the nature of this 
supplementary field depends on the properties of the chemical bond 
between a given atom and the others. Thus, the resonance frequen- 
cies of a given atom vary somewhat depending on the chemical bond. 
This phenomenon is known as chemical displacement. 

In Fig. 241, we see an oscillogram of the absorption spectrum of 
a chemical compound. This illustrates magnetic resonance for fluor- 
ine nuclei. We see four peaks, one of which is three times as high as 
the other three. In the molecule whose structural formula is shown in 
the figure, four “different” fluorines are to be seen. There are three 
times as many fluorine atoms in the CF, group as each of the other 
three “chemically different” fluorine atoms in the molecule. Chemi- 
cal displacement has separated the nuclear resonances of the fluor- 
ine atoms and made it possible to determine the structural formula 
of this compound. 
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Thus, nuclear resonance provides us with a new means of chemical 
analysis. Instead of obtaining only a gross chemical formula (in our 
example, the total number of fluorine atoms relative to, say, hydro- 


F 
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F@ 
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gen atoms), we can now obtain a detailed picture of a chemical for- 
ae i.e., the proportions of differently bound atoms of a single 
ind. 





Absorption 


Magnetic field intensity 


Fig. 241 


210. Quadrupole Resonance 


The scheme of molecular energy levels discussed in the preceding 
chapter lacks certain detail. It turns out that each rotational level 
has an infrastructure. Between the electronic cloud of a molecule 
and the atomic nuclei there may exist yet another interaction: an 
atomic nucleus may possess an electric quadrupole moment and de- 
pending on the orientation of the atomic nucleus relative to the elec- 
tronic cloud the molecule may possess different values of energy. The 
magnitudes of the energy associated with this interaction are quite 
small and the corresponding energy levels are grouped around the rota- 
tional levels. 

Thus, to describe a molecule, one must indicate its quadrupole 
energy level in addition to its electronic, vibrational and rotational 
levels. 

Quadrupole interaction does not always exist. If it does, the rota- 
tional transitions discussed above are in fact rotational-quadrupole 
transitions. Pure quadrupole transitions, i.e., transitions between 
separate quadrupole levels, may be observed. Moreover, rotational- 
quadrupole transitions may be resolved. Both problems can be 
solved by radio-spectroscopic methods. Pure quadrupole transitions 
lie in the range of 1-800 mc/s, i.e., in the short-wavelength band. 
Rotational-quadrupole transitions may be detected by studying the 
absorption of microwaves (millimetre waves) in gases. 

Pure quadrupole transitions are of primary interest. They can be 
observed in solids and in certain liquids. 

The following formula gives the energy of interaction between an 
atomic nucleus and the electronic c!oud of a molecule for the case of 
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an axially symmetrical field (such a field exists in all linear mole- 
cules): 


a 3m?—s(s+1), 
6 = 009 asi) 
here, Q is the quadrupole moment of the nucleus and qg=5 


is the second derivative of the electric potential along the axis of sym- 
metry of the field. This effect does not occur in the case of nuclei 
having a spin of 0 or %. Also, no interaction occurs when the elec- 
tronic cloud surrounding a nucleus is spherically symmetrical. 
There are a limited number of levels. If the selection rules are ta- 
ken into account, one finds that the number of possible transitions 


is quite small. Thus, one line occurs when s=1 and s=5, two lines 


when s= 2, and three lines when s =f. 

A quadrupole absorption spectrum can be obtained using a genera- 
tor the frequency of which is continuously varied in the wavelength 
interval under investigation. The resolving power of radiospectro- 
scopic methods is very high. In the case of a spectral line the frequen- 
cy of which is of the order of 30 mc/s, the line width is equal to sever- 
al hundred cycles per second. 

The electric quadrupole moment Q is a constant of an atomic nu- 
cleus. It describes the deviation from spherical symmetry of the dis- 
tribution of electric charge in a nucleus. The quantity Q (in square 


centimetres) can be determined from the above formula if q= aad 


is known and the quadrupole frequencies can be measured. The 
deviation from spherical symmetry of the distribution of electric 
charge in a nucleus can be determined to a first approximation by rep- 
resenting the nucleus as an ellipsoid of revolution. If the nucleus 
is elongated in the spin direction, then Q>0; if it is elongated perpen- 
dicular to the spin, then Q<0. 

An ellipsoidal nucleus tends to become oriented in a very definite 
manner in the field of an electronic cloud. The main energy level 
corresponds to an arrangement in which the axis of symmetry of the 
field and the axis of the ellipsoid coincide. In excited states, in view 
of the discreteness of energy, the axis of the ellipsoid can assume only 
several selected orientations relative to the axis of symmetry of the 
field. The energies of these quantum states is given by the above 
formula. An electromagnetic wave impinging on a molecule is ab- 
sorbed if the magnitude of the photon corresponds to the energy of 
transition from one orientation of the ellipsoid to another. 

The study of quadrupole spectra began quite recently. Such spectra 
are of great scientific interest as can be gauged by the fact that they 
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enable us to measure frequencies with extremely high accuracy and 
that the quadrupole frequency will react to very small changes in 
the electric field of the molecule of which the nucleus is a component 
as well as of neighbouring molecules. 

Suffice it to say that quadrupole frequencies will differ percepti- 
bly in crystalline varieties of one and the same substance. Thus, a 
nucleus reacts not only to changes in the field due to close electrons, 
but also to changes in the field due to more distant electrons. 


Example. The electric quadrupole moment of a nucleus of Cl*§ is Q=— 0.07X 
x 10~*4cm?, Quadrupole resonance occurs in Cl, when v=54.47 mc/s. For a 
nucleus of Ci**, the spin s is equal to +. This means that the quantum number 


3 1 1 
m assumes the values —,—>, -> 


ar . and —2 . Since quadrupole interaction energy 
is a function of m?, when anelectromagnetic energy quantum hv is absorbed 
only one transition is possible, i.e., a transition from the level corresponding to 
|m [= 4 to the level corresponding to |mja. 
The resonance condition is 


eG eG) 9) RGN) ag 


— ee  , 


3 3 3 3 a 
tx (2xF-1) ixg (exZ-1) 


By measuring the resonance frequency v and knowing from other data the value 
of the quadrupole moment Q of a Cl** nucleus, one can determine the gradient 
of electric field due to electrons at the centre of a Cl®* nucleus in a Cl, mole- 
cule: 


=2.14x10° Gaussian units. 





JOE] 2hv 2X 6.6 X 10-27% 54.5 X 10" 
: cs = eQ 4.8 XK 10-1 0.07 X 107% 


211. Nucleon Interaction Forces 


Our basic knowledge of nuclear forces can be obtained by studying 
the scattering of particles. It was concluded from Rutherford’s first 
experiments in the scattering of @-particles that nuclear forces have 
a very small range. Rutherford’s results could be explained quantita- 
tively by assuming that the deflection of a-particles is due to electric 
repulsion between charged particles having the same sign. The ex- 
perimental results agreed with the results of theoretical calculations 
even when an a-particle passed extremely closely to the scattering 
nucleus. This means that it suffices to separate two nuclear partigles 
by a very small distance in order for the effective forces to consist 
only of electric forces; the nuclear forces will then no longer be efiec- 
tive. : 

More direct evidence can be obtained from the scattering of neu- 
trons by protons. For this purpose, a neutron beam is passed through 
hydrogen in a gaseous state. Experiments indicate that only a small 
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fraction of the neutrons collide with nuclei of hydrogen atoms. The 
angular distribution of scattered neutrons is uniform. This result 
differs basically from that of a-particle scattering, i.e., scattering 
due to electric interactions. In such scattering, deflection always 
occurs, but the deflection is small when an a-particle passes far from 
a nucleus and large when such a particle passes close to a nucleus. 
It can be concluded from the patterns obtained in the scattering of 
neutrons by protons that the effective range of nuclear forces is very 
small. A value of the order of 2x 107** cm is reliably deduced from 
such experiments. 

This is also the value obtained for the effective range from the scat- 
tering of protons by protons. In this case, the experiments and 
calculations are rather intricate since it is necessary to “deduct” that 
portion of the scattering due to purely electric interaction. However, 
the deduction required can be determined using data from observa- 
tions at high energies and large angles. Unfortunately, direct experi- 
ments in the scattering of neutrons by neutrons are not possible, but 
considerable indirect evidence indicates that in this case too nuclear 
forces have the same properties. For example, let us compare the bind- 
ing energies of tritium (hydrogen isotope of mass 3) and the helium 
isotope of equal mass. In the first case a nucleus consists of two neu- 
trons and one proton and in the second of two protons and one neu- 
tron. It turns out that the binding energy of this helium isotope is 
less than that of tritium by an amount exactly equal to the electric 
interaction between two protons. ; 

From these experiments one can conclude that the nuclear forces 
acting between nucleons are independent of the electric charges of 
the interacting particles. 

From experiments in the scattering of nucleons one can also con- 
clude that the interaction is of an exchange nature. The term exchange 
is used to indicate that the colliding particles interchange properties, 
i.e., a proton is transformed into a neutron and vice versa. Experi- 
mentally, this has been shown to occur in the scattering by protons 
of a beam of neutrons having very high energies (scores of times 
greater than the potential energy of interaction between protons and 
neutrons). One would expect that in such an experiment most 
neutrons pass through hydrogen without scattering, but in fact the 
forward scattered beam consists of equal numbers of neutrons and 
protons. 

The problem of nuclear forces is complicated by the fact that such 
forces depend on nuclear spin orientations. These orientations cause 
nuclear forces to lose their central nature, i.e., the forces do not act 
along the lines joining the centres of particles. 

Many difficulties have not yet been overcome and the nature of 
nuclear forces still remains unexplained. It is quite probable, however, 
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that the meson theory of the Japanese physicist Yukawa will contin- 
ue to form the basis of the theory of nuclear forces. This theory 
will be discussed in Sec. 214. 


212. Nucleons in a Nucleus 


Nucleons are very closely spaced in a nucleus. The following for- 
mula for the “radius” of a nucleus is in reasonably close agreement 
with a number of experimental facts: 


R=ky/M. 


Here, M is the mass number and k=1.5x10~"*cm. The formula has 
been established for heavy nuclei, but it is undoubtedly valid for 
light nuclei as well. Since the radius is proportional to the cube root 
of the number of particles in a nucleus, the volume is proportional to 
the first power of this number. Therefore, it must be assumed that, at 
least to a rough approximation, the packing density of nucleons in 
a nucleus is uniform. 

Since nucleons are of a wave nature, one cannot, of course, con- 
struct a geometric model of an atomic nucleus and determine the path 
of nucleons in a nucleus. 

As an approximation, each nucleon can be pictured as moving in 
the field of all the others. Such a nucleon will have a system of energy 
levels which can be filled consecutively in going from light nuclei 
to heavier ones. Like in the case of electrons, the lowest level cannot 
have an angular momentum. In accordance with the Pauli exclusion 
principle, at this lowest level we can have two neutrons and two 
protons (the particles of each pair having opposite spins), i.e., 
an a-particle. An analysis of heavier nuclei shows that stable 
groups of particles will also be located at the other levels. The shell 
model of a nucleus has proved very useful in determining a number 
of nuclear properties and in explaining the prevalence of various 
isotopes. 

It will be noted in examining the composition of atomic nuclei that 
as we proceed from light particles to heavier ones the number of neu- 
trons in an atomic nucleus increases more rapidly than the number 
of protons. Thus, in the case of the very stable element lead, there are 
82 protons and 126 neutrons. This increase in the number of neutrons 
can be explained by the need to counterbalance the increased etec- 
tric repulsion of the protons. 

From another viewpoint, of course, the lack of equality between 
protons and neutrons is disadvantageous. When this equality exist- 
ed, the low energy levels could be filled with a maximum number 
of particles since, in accordance with the Pauli exclusion principle, 
{wo neutrons and two protons can exist in one state. However, if 
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this occurred, the electric repulsion would increase too much and the 
total energy would not be a minimum. Evidently, an actual case 
represents a compromise solution between these two tendencies. 
Beta decay phenomena, which occur so frequently in artificially ra- 
dioactive elements, represent the selection of the optimum situation 
in the indicated sense. If there are too many protons or too many neu- 
trons in a nucleus, this is corrected by the emission of an electron or 
a positron. 

Like in the case of an atom, a nucleus usually occupies the lowest 
energy level, but it can exist in an excited state. It has been known 
for a long time that radioactive decay is often accompanied by elec- 
tromagnetic radiation. The reason for this is that after emitting an 
a-particle a nucleus remains in an excited state and then passes to 
a low energy level by emitting a photon of very high frequency. 
The latter emission is known as y-radiation. A nucleus can also pass 
into an excited state by colliding with another particle. A study of 
the transition energies of a nucleus enables us to construct its system 
of energy levels. 


213. Interaction of Fast Electrons 


If electrons move slowly, the forces of interaction are determined 
by the arrangement of the charges at the instant of interaction. There- 
fore, in the case of slow electrons, the fact that an electromagnetic 
field exists is of no significance, i.e., it is unimportant that the 
interaction is transmitted by means of a field. 

The situation is different in the case of fast particles. Here, it is 
necessary to take into account the lag in interaction due to the finite 
velocity of propagation of an electromagnetic field. Interaction at 
the instant ¢ is determined by the arrangement of the charges at the 


instant t—— . Now, the field cannot be dispensed with. How can the 


quantum nature of a field be taken into account while maintaining 
the particle-field-particle interaction scheme? Such questions are 
dealt with in quantum electrodynamics, a field of theoretical phys- 
ics still being developed. Experimental evidence compels us to 
assume, in considering the interaction of fast particles that such 
interaction consists in the transfer of an energy quantum to the 
field by a particle and the subsequent transfer of this quantum. to 
a second particle. 

If the magnitude of the transferred quantum of energy is equal to 
& and the time required to transfer this quantum from particle to 
particle is equal to t, then according to the principle of uncertainty 
(see p. 477). 

Gt~h. 
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If the particles are close to each other, t is small and the particles 
can exchange quanta Av of low and high frequencies. As the distance 
between particles increases, t increases and energy exchange can oc- 
cur only by means of small quanta. Using such reasoning, one can de- 
rive a formula for the force of interaction between particles. 

Many authors describe the quantum interaction of particles through 
the medium of a field in the following picturesque terms: In transfer- 
ring a quantum of energy @ to a field, a particle “lends” this energy 
for a short period of time 7. The principle of uncertainty defines the 
dependence of the lending time on the amount of loaned energy. It is 
as if the law of conservation of energy were “violated” for the period 
of time T, i.e., the lending time. The principle of uncertainty indi- 
cates the permissible time interval during which the law of conser- 
vation of energy may be “violated” in order for the “violation” to be 
physically meaningless. 

This viewpoint may be developed further, but it leads to the fol- 
lowing difficulty. Since there is no limitation in the absorption and 
radiation of photons, infinitely large changes in the value of a parti- 
cle’s intrinsic energy occur when photons are exchanged. 

It is interesting that this occurs in all approaches to the evaluation 
of the intrinsic energy of an electron or other charged particle. It 
was seen on p. 243 that a point particle possesses infinite energy. At 
the same time, it is impermissible to assume that the particle has 
finite dimensions since a perfectly solid particle having finite dimen- 
sions cannot exist according to the theory of relativity. (The exist- 
ence of perfectly solid bodies is inconsistent with this theory since in- 
teraction would be propagated instantaneously through such bodies.) 
But if an electron is a deformable particle, what is its structure? A 
solution to this problem has not yet been found. 

A distinctive feature of the new theory is that, in spite of the fact 
that it is based on contradictions and lacks logical harmony, it leads 
to a number of new, very interesting results. 


214. Meson Theory of Nuclear Interaction 


In the preceding article, we stated that interaction between elec- 
trically charged particles occurs through the medium of an electromag- 
netic wave by means of quanta. An electrically charged particle 
transmits a quantum of energy to an electromagnetic field and this 
energy is then transmitted to another particle. If it is assumed that 
a field is associated with nuclear forces and that this field is also of a 
quantum nature, interaction between nucleons can be described as 
follows: Each nucleon is surrounded by a field; a nucleon transmits a 
quantum of energy to the field and the field transfers this energy 
to another nucleon. 
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A theoretical study to determine whether such an explanation of 
nuclear forces is permissible was undertaken by Yukawa. It turned 
out that a theory could be developed if it is assumed that the field, 
through the medium of which nucleons interact, possesses quanta hav- 
ing a rest mass that is not equal to zero. Interaction between nucle- 
ons was reduced in this manner to an exchange of particles having 
amass m=40. For reasons to be explained below, such particles became 
known as mesons. A meson is a quantum of the mesonic field which 
surrounds nucleons. 

Now, let us examine several conclusions to be drawn from the 
theory. First, we shall try to estimate the range of nuclear forces. 

The energy of a meson which is transferred by a nucleon during in- 
teraction cannot be less than m,c?, where m, is the rest mass of the 
meson. The order of iaenitude of the time taken to transfer a meson 


is not greater than —. aaa (on the basis of the relation 6t~A, where @ is 


the quantity of francterred energy and t is the time taken to transfer 
this quantum). Since the velocity of a meson is less than c, a meson 


cannot be transferred over a distance greater than a Thus, the 


vue of this constant quantity should give us the range of nuclear 
orces. 

Such is the conclusion drawn by Yukawa. Using the range of nucle- 
ar forces known at that time, Yukawa showed that theoretical and 
experimental results coincide when m,, the rest mass of a meson, is 
200-300 times greater than the mass of an electron. 

Positive, negative and neutral mesons have equal validity in 
this theory. Thus, all interactions between nucleons can be easily in- 
tegrated within the framework of one theory. Designating Yukawa’s 
meson by a and using a superscript to indicate the sign of the meson, 
one can express the interaction between two neutrons or two protons 
as an exchange process involving a neutral meson: 


etptmw; naetntn, 


An exchange interaction between a proton and a neutron isa 
process involving a positive or a negative meson: 


prtn-+nt; noepin. 


Yukawa’s theory was developed before the discovery of mesons. 
At present, the above interaction formulas no longer represent theo- 
retical predictions, but rather expressions for phenomena which have 
been actually observed. 


Sec. 215] Mesons 555 


215. Mesons 


The term “meson” (from the Greek word “mesos”, meaning average 
or intermediate) was coined to indicate that the mass of such a par- 
ticle lies between that of an electron and that of a proton. Experi- 
ments show that several kinds of mesons exist. Mesons (electrically 
charged) were first detected in cosmic rays. Now, mesons are created 
in accelerators—they arise when nucleons collide. 

However, not all mesons play the same role in interactions between 
nucleons. The Yukawa meson is a m-meson. As has been indicated, 
positive, negative and neutral a-mesons exist. Recent meas- 
urements indicate that the mass of am-meson is equal to 273 m,, 
where m, is the mass of an electron. 

The mesons which were first found in cosmic rays are p-mesons 
(positive and negative). The mass of a p-meson is equal to 207 m,. 
Ten years elapsed after the discovery of u-mesons beforeit was shown 
that such mesons are products of a-meson decay. The reason why 
investigators were able to detect p-mesons but were unable to detect 
st-mesons lies in the different lifetime of these particles. The ave- 
rage lifetime of a p-meson is about 107° sec, but that of as-meson is 
about a hundredth of this value, i.e., of the order of 10~* sec. 

Such transformations have been photographed numerous times. 
If we follow along a p-meson track on such a photograph, we may 
find that the velocity of the particle decreases (easily recognised by the 
change in the thickness of the particle track: the slower the particle 
the heavier the track since a slow particle creates more ions along its 
path than a fast one). At the point where the track ends, another 
track begins. Calculations indicate that the new particle is of very 
high energy, which may be explained only if we assume that a part 
of the rest energy of a a-meson has been transformed into kinetic 
energy. In addition, in order to satisfy the law of conservation of 
momentum, one must assume that a neutral particle of very small mass 
is created when a x-meson is transformed into a u-meson. Thus, the 
neutrino is again encountered. 

The rest mass of a-meson is of the order of 150 Mev. Therefore, a 
meson may be produced by nuclear bombardment with projectiles 
having an energy greater than this value. Actually, an energy great- 
er than 300 Mev is required. Nuclear bombardment with high-ener- 
gy particles results in the creation of a large number of mesons. 

Charged and neutral mesons have been created under laboratofy 
conditions. In the case of neutral mesons, this is particularly difficult 
since the lifetime of such a particle is 10~** sec. This means that the 
particle traverses a distance of only a thousandth of a millimetre 
during its lifetime. 
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216. Relativistic Theory of an Electron 


When the Schrédinger equation was discussed in Sec. 180, we did 
not use the relations of the theory of relativity since we assumed that 
ihe erent: of the particles were much less than the velocity of 

ight. 

We cannot dispense with the relativistic corrections when dealing 
with high-energy electrons, i.e., electrons having energies of the or- 
der of millions of electron-volts. Such high-energy electrons are en- 
countered in radioactive radiation, in X-ray tubes the operating volt- 
ages of which are equal to several million volts, in betatrons, etc. 

To correctly describe electrons having such velocities by a wave 
function, one must take into account the relationship between energy 
and momentum given by the theory of relativity. 

If the rest mass of a body is given, the energy and momentum of the 
body can be directly determined from the velocity of the body. There- 
fore, energy and momentum are related. Squaring the expressions 


& Myce Mv 


i cn Ce 


and then subtracting, we obtain 








2 
= p* + mie. 


This relationship can be written in the following form: 


G=+¢c V pt mic? 


(see Fig. 242). When p—0, = +m,c’. This means that the coordinates 
€ 


of the points at which the two branches 
of the curve intersect the axis represent 
equal rest energies of the particle. 

For a long time, no attention was paid 
to the negative branch of the curve. It 
seemed impossible that particles could 
exist which obeyed the laws of this 
branch. Indeed, a particle having nega- 
tive rest energy and the energy of which 
decreases as the momentum increases 
must behave very strangely. It is like a 
particle having negative mass, which 
Fig. 242 means that when a force is exerted on 
such an “electron” the particle is acceler- 





Sec. :216]. Relativistic Theory of an Electron _ 557 


ated in the opposite direction to that of the force. Thus, if we 
try to attract such a particle, it is repelled. Imagine that we 
have two close electrons—an ordinary one and an extraordinary 
one, i.e., one which obeys the laws of the lower curve. Elec- 
trons should repel each other. Therefore, the ordinary electron will 
tend to move away from the extraordinary one. But since the latter 
“electron” will be attracted under the action of the force of repulsion 
it will move toward the ordinary electron. As a result, this pair 
will be accelerated jointly, whereby the increase in positive kinetic 
energy of the ordinary electron will be counterbalanced by the in- 
crease in negative energy of the extraordinary particle. 

Does such a particle exist? First, it should be noted that from the 
viewpoint of quantum mechanics this is not another particle at all. 
The negative branch of the curve of energy as a function of momentum 
should be interpreted as giving the lowest energy level of one and the 
same particle, namely, an electron. An electron may be located at 
an ordinary energy level or at a negative energy level. Like in the 
other cases, transition to the lowest energy level should be accompa- 
nied by energy radiation. 

To be sure, in such a transition the radiation cannot be limited to 
a single photon. This may be explained as follows. When an electron 
passes from an @,, p, state of the upper curve to an@,, p, stateof the 
lower curve, the energy released is 


6,—6,=C (/ pi tmict + pit+mic'), 


i.e., more than c (p,+p,). If this energy were transferred to a single 
photon, then we should have €6,—&, = Av and, therefore, p,+ p, = seas 


c 
i.e., the change in the momentum of the photon (p, is anti-paral- 


lel to p,) would equal My . But 6, —@, is greater than ¢ (p,++p,), not 


equal to it. In the case of a single photon, it is not possible to satis- 
fy simultaneously the law of conservation of energy and the law of 
conservation of momentum, but this is always possible if two pho- 
tons are emitted. 

We should not be disturbed by thefact that the lower state corre- 
sponds to negative energy and that particles with @<0 behave very 
strangely. The difficulty lies elsewhere: Since there is limitless room 
at the lower level in view of the fact that it extends to all possible 
values of momentum, why do not all ordinary electrons pass over 
to this level? The proposed model accounts for the existence of only 
“extraordinary” electrons. To accord with this model, an ordinary 
electron should have a short lifetime, much like an excited atom. 


558 Atomic Nuclei (Chap. XXX 


217. Creation and Annihilation of Pairs of Particles 


A relativistic theory of the electron was developed by Dirac in 
1928. The description of an electron in this theory differs considerably 
from that in the Schrodinger theory. Four wave functions are used to 
describe the behaviour of an electron since one wave function no long- 
er suffices. The existence of electron spin does not follow from the 
Schrédinger theory, but electron spin is a necessary consequence of 
the Dirac theory. The success of this theory in describing numerous 
phenomena speaks for the validity of its basic concepts. Here, we 
shall consider in the light of this theory only the contradiction dis- 
cussed in the preceding article. How can we explain the fact that a 
tremendous number of ordinary electrons exist? How can we explain 
the “reluctance” of these electrons to pass over to the lower, nega- 
tive energy level? 

The Pauli exclusion principle does not allow an electron to pass 
over to a lower level if that level is occupied by two electrons having 
opposite spins. Let us apply this principle to our problem. The solu- 
tion will be obtained if we assume that all negative energy states are 
occupied. Does this mean that all space is completely filled with elec- 
trons having a negative energy state? This conclusion appears ines- 
capable. The Dirac theory has led to anew view of vacuum. In this the- 
ory, vacuum acquires physical properties, i.e., it is filled with elec- 
trons in a negative energy state. Moreover, it is filled boundlessly 
since the decrease in energy can extend to negative infinity.” 

Let us determine which phenomena can be explained and predict- 
ed by this theory. If all negative energy states are filled, their exist- 
ence may be detected only when an electron passes from a negative 
energy level to a positive energy level after receiving a significant 
portion of energy—in any case, not less than 2 m,c*. Such a process 
may involve the expenditure of two photons. Another possibility 
is the following: A photon passing close to a heavy atomic nucleus, 
which has a strong electric field, may give up its energy to raise an 
electron from a negative to a positive level. The role of the atomic 
nucleus consists in providing the necessary momentum. 

In both cases an electron is “created”. But in addition to the crea- 
tion of an electron, a “hole” is produced in the negative energy 
states. The removal of an electron, i.e., a negative charge, means that 
the hole acquires a positive charge of equal magnitude. On the other 
hand, the absence of a particle having negative energy signifies that 
the energy has increased. Therefore, the greater the momentum of a 
hole, the greater its energy. The inescapable conclusion is that “holes” 
behave like positively charged electrons having positive energy. Ex- 
cept for the sign of the charge, the behaviour and laws of motion of 
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a positive electron (positron) in no way differ from those of an ordi- 
nary electron. 

A positron and an electron are “created” as a-pair at the expense 
of the energy of photons. The reverse process, i.e., annihilation, is 
also possible. This consists inthe transformation of a colliding elec- 
tron and positron into two photons or, if annihilation occurs close to 
a heavy atomic nucleus, into a single photon. 

It is understandable why positrons have a short lifetime: they are 
attracted by electrons and disappear upon collision. But why do not 
ae disappear? The reason is simply that there is an excess of 

em. 

Do systems exist in which there are an excess of positrons, i.e., 
in which electrons are unstable? Such a supposition is not at all 
ridiculous. 

Pair creation and annihilation are processes which can be easily 
observed in large numbers under laboratory conditions. When 
y-rays having an energy greater than | Mev pass through a thin metal 
foil, the electrons which are ejected from the foil will be deflected in 
a magnetic field in opposite directions. By tracing the path of a posi- 
tron (means of observing charged particles are discussed in Sec. 204), 
one can determine the point at which the particle disappears. This 
is where the positron combined with an electron. Using modern pho- 
ton counter devices, one can show that two oppositely directed pho- 


tons, each having an energy of the order of — simultaneously emerge 
from this spot. 
218. Particles and Anti-Particles 


There is no reason for supposing that the existence of the positron— 
it would be better to call it an anti-electron—is due to a peculiar fea- 
ture of small particles. In spite of the distinctive features of the theo- 
ry of interaction between nucleons, it is basically similar to the theo- 
ry of interaction between electrons. In most theoretical studies, nu- 
cleons are considered to be described by equations which are quite 
similar to the Dirac equations for electrons. It was to be expected, 
therefore, that nucleons have anti-particles which stand in the same 
relationship to the proton and neutron as a positron does to an elec- 
tron. The first of such anti-nucleons to be discovered was the 
anti-proton. Somewhat later the anti-neutron, the magnetic moment 
orientation of which differs from that of the neutron, was discovered. 
(The magnetic moment and the angular momentum vector are anti- 
parallel in the case of the neutron and parallel in the case of the anti- 
neutron.) 

The discovery of the anti-proton proved that the concept of an 
inseparable link between field and particles is well founded. Like in 
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the case of a positron-electron pair, a proton-antiproton pair may be 
created by the passage of a particle (a nucleon) from a negative ener- 
gy state to a positive energy state. For this purpose, an energy of at 
least 2Mc’ is required. This is a tremendous amount of energy—1,840 
times the energy required to create an electron-positron pair. An 
accelerator which could accelerate particles to billions of electron- 
volts had to be constructed before it was possible to discover the anti- 
proton. 

If a proton and an anti-proton collide, they will be annihilated. 
Since nucleons transfer energy through the medium of a meson field, 
when they are annihilated their mass and energy are given up to 
quanta of this field, i.e., to mesons. This process will undoubtedly 
be subjected to careful study in the years ahead. 

Fig. 243 shows a photograph of the annihilation of a proton and an 
anti-proton. This process occurred in a bubble chamber filled with 
liquid propane. A sketch of the process is given in the upper left- 
hand corner. 

The reasoning used in explaining why anti-particles must exist 
extends to the anti-neutrino as well. An anti-neutrino is the “mirror” 
image of a neutrino. The difference between the particles composing 
this doublet is the same as in the case of the neutron-antineutron 
doublet. 

Mu mesons and other elementary particles which have not been 
discussed are also encountered as doublets. 

Pimesons constitute a triplet: there is a 1-meson having a positive 
charge, another having a negative charge and still another having 
zero charge. In contradistinction to the neutron and the neutrino, 
the neutral m-meson has no spin and, therefore, can have no anti-par- 
ticle (or expressed differently: the neutral m-meson coincides with its 
anti-particle). The photon is another particle which has no “mirror 
image”. 


219. Asymmetry of Elementary Particles 


Nucleon-nucleon interactions occur with the emission and absorp- 
tion of s-mesons. This is the strongest of the various interactions be- 
tween elementary particles and is responsible for the force binding 
nucleons in an atomic nucleus; its duration is 107** sec. Nuclear forces 
are about a hundred times as powerful as electromagnetic forces; 
the duration of electromagnetic interaction, which occurs by the ex- 
change of photons, is 10~** sec. These two types of interaction are 
known as strong interactions, in contradistinction to the weak inter- 
actions occurring in particle*transformations in which neutrinos take 
part. Examples of weak interactions are the transformation of a neu- 
tron into a proton, accompanied by the emission of a neutrino and 
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an electron (see f-decay), and the decay of m- and u-mesons: 
gm—>ptv and p-ety +4. 


The duration of weak interactions is of the order of 10~° sec. Nuclear 
forces are about 10'* times as powerful as weak interaction forces. 
In calculating the interaction force from the duration of the process, 
we are assuming, of course, that other conditions remain equal. 

Some extremely interesting discoveries have been made recently 
in connection with weak interactions. It has been found that weak 
processes exhibit “left-right” asymmetry. Thus, for example, in 
B-decay of cobalt nuclei polarised at low temperatures by means of 
a magnetic field (polarisation of the particles orients their magnetic 
moments and spins in a definite direction), the angular distribution 
of electrons is asymmetrical with respect to the “forward” and “back- 
ward” directions. Similarly, in p-meson decay, asymmetry with re- 
spect to the direction of motion of the particles has been detected. 

A theory for this phenomenon was proposed by Lee and Yang, and 
by the Soviet physicist Landau. Two explanations are possible: ei- 
ther the particles are internally asymmetrical or space is asymmetri- 
cal. We shall restrict ourselves to the first explanation. Its essence 
lies in the assumption that elementary particles are like screws as re- 
' gards their symmetry properties. Such asymmetrical particles are 
well known to the physicist. They include, for example, the left and 
right optical antipodes of molecules (see p. 409). It is clear that an 
asymmetrical elementary particle the axis of which is randomly 
oriented has different properties in the “forward” and “backward” 
directions. This has been proved experimentally. 

In order to explain the asymmetry observed in experiments with 
beams of y-mesons, one can use the Landau hypothesis, which relates 
the asymmetry of a particle to its charge. As indicated in the preced- 
ing article, all particles with the exception of the photon, are en- 
countered in nature as charged pairs. Landau proposed that if the 
symmetry properties of a particle are like those of a right-handed 
screw, the properties of its anti-particle are like those of a left-hand- 
ed screw. Reflection in a mirror makes a right hand appear like a 
left hand and a right-handed screw like a left-handed screw. Thus, 
the “mirror image” of a particle is its anti-particle. 

What application has this hypothesis to experiments with beams 
of -mesons? It can be proved that a particle having no mass must 
be oriented with its spin in the direction of motion. The mass of a 
neutrino is evidently equal to zero. Therefore, neutrinos are “longitu- 
dinally polarised”. The difference between a neutrino and an anti- 
neutrino can be reduced to the followinz: the spin orientation of a 
neutrino is in its direction of motion, while that of an anti-neutrino 
is in the opposite direction. Mu mesons are formed when x-mesons 
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decay. Since the spin of m-meson is equal to zero, the spin of a u-mes- 
on must be parallel to the spin of a neutrino. Therefore, the p-mesons 
formed in aa-meson beam are longitudinal. This explains the asym- 
metry observed in the distribution of electrons during the subsequent 
decay of these p-mesons. 

Investigation of the asymmetry of elementary particles may result 
in a significant revision of anumber of fundamental concepts in 
physics. 
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CHAPTER XXXI 
NUCLEAR TRANSFORMATIONS 


220. General Laws of Chemical and Nuclear Transformations 


Now, we shall discuss several energy relations which are equally 
applicable to chemical reactions and to transformations of atomic 
nuclei and other particles. 

Transformations can occur only when particles closely approach 
each other. Since particles must possess a certain kinetic energy in 
order for a transformation to occur we are quite justified in using the 
term “collision” to describe a close approach between particles. Not 
every encounter between particles results in a transformation. The 
mechanism of chemical and nuclear transformations is very diffi- 
cult to study. Since direct observations are impossible, one is forced 
to make hypotheses the validity of which can be checked indirectly. 

In the case of chemical transformations, the mutual orientation of 
molecules upon collision undoubtedly plays an important role. In 
order for a reaction to occur, molecules must approach each other in 
a manner suitable for the regrouping of 
atoms. 

For every transformation which can 
occur on a mass scale—the usual case 
in chemical and nuclear reactions where 
billions of molecules or nuclei collide 
during a short interval of time—one can 
indicate, in principle, the number A 
which, generally speaking, will be a meas- 
ure of the fraction of encounters in 
which particles are in a position “suita- 
ble” for a transformation to occur. 

However, the requirement of appro- 
priate orientation is, of course, not the 
only condition to be met in order for a 
transformation to occur. Since a particle is ordinarily stable and, 
hence, possesses a minimum of potential energy, an energy sufficient 
to lift the molecule out of its potential well must be imparted to 
it. This minimum necessary energy is called the activation energy. 
Fig. 244 shows a potential energy curve. The particle is stable 
when r=0, where r is a fixed parameter. In order for a reaction to 
occur, an activation energy @ must be provided. For the case repre- 
sented in the figure, the reaction proceeds with a liberation of heat. 





Fig. 244 
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We can apply the Boltzmann law to collisions between molecules, 
or between nuclei (under similar conditions), i.e., wecan assume 
that the number of encounters resulting in a transformation is pro- 
portional to e—*/*’, where @ is the activation energy. 

It is evident that therate of transformation may be expressed as the 
product Ae-#/*?, where the first factor takes into account the “geo- 
metric” conditions of encounter and the second the energy aspect. 

It is customary to give special consideration to the case of two col- 
liding particles at the instant when the potential energy is a maxi- 
mum. Such an activated complex (as it is referred to in chemistry) 
or compound nucleus (as it is referred to in the study cf nuclear trans- 
formations) does not exist for a very long period of time. The system 
may “slip back” into the poten- ,, 
tial well or “roll over” the well 
wall. In the latter case, a trans- 
formation has occurred and a 
new system with a new poten- 
tial energy has been formed. 

In chemical as well as nuclear 
transformations, the resulting 
system may consist of a single 
new particle (addition reaction) 
or two new particles. 

If the potential energy of the Mm#o-~ ~~ 
new particles is greater than the 
potential energy of the original 
particles (cf: the bottom of a 
volcano’s crater is below the lev- 
el of the foot of the mountain on which the volcano is located), the 
transformation proceeds with the absorption of energy. The absorbed 
energy (heat) will be equal to the difference between the activation 
energy and the energy of the reaction products (see Fig. 245). If 
the energy of the created particles is less than the energy of the 
original particles, heat is liberated. 

Both kinds of transformations are encountered in chemistry and in 
nuclear physics. Reactions proceeding with a liberation of heat are 
called exothermic, while those proceeding with an absorption of heat 
are called endothermic. 

Chemical and nuclear transformations are often accompanied by 
radiation. However, as a rule, the main energy effect of a reactior®con- 
sists in the transformation of the potential energy of the atoms in a 
molecule (or nucleons in a nucleus) into kinetic energy. Therefore, 
generally speaking, transformations in which heat is liberated are 
those in which two slow particles collide and produce two fast ones. 
Of course, in endothermic reactions, the opposite takes place. 





Fig. 245 
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It isseen from the formula that the rate of transformation increases 
exponentially with temperature. This is the reason why chemical 
transformations are extremely sensitive to changes in temperature. 
The higher the temperature, the greater the impact of colliding par- 
ticles. It is well known that temperature plays an important role in 
chemical transformations. In nuclear transformations, as a result of 
the tremendous values of binding energy, the role of temperature 
change is not so noticeable. The activation energy of atomic nuclei 
has an order of magnitude of several Mev, but if, for example, the 
temperature is increased to 3,000°, the energy of an atomic nucleus 
increases by only 0.4 ev. 

The temperature must be increased to millions of degrees rather 
than to thousands if we wish to accelerate nuclear transformations 
(see below). 


221. Nuclear Reactions 


A mass of experimental data has been accumulated on nuclear reac- 
tions. The number of such reactions which have been studied reaches 
several thousand. 

At present, the following types of nuclear reactions are known 
(in addition to radioactive decay, which may be viewed as a nuclear 
decomposition reaction): capture reactions, in which two colliding 
particles combine; exchange reactions, in which a particle is captured 
and another is ejected; and fission reactions, in which a nucleus 
breaks up as the result of energy received in one or another form. 
Nuclear reactions which occur under the action of hard y-rays are 
known as photonuclear reactions. 

By means of nuclear reactions, one can obtain stable natural iso- 
topes as well as unstable radioactive isotopes which are not encoun- 
tered in nature. It has proved possible to synthesise elements which 
have no stable isotopes (for example, technetium, an element having 
the atomic number 43), and also transuranic elements. 

The reactions which occur when various nuclei are bombarded with 
a-particles, protons and neutrons have been studied most carefully. 

When an a-particle collides with a nucleus, one of two types of 
reactions generally occurs: either the a-particle is captured and a pro- 
ton (p) is ejected or the a-particle is captured and a neutron (n) is 
ejected. These reactions are designated by the symbols (a, p) and 
(a, n), respectively. 

The equation for an (a, p) reaction has the form 


ZEIM + .a*—> 7, ,E1M*? +p’. 
The equation for an (a, n) reaction has the form 


gelM+ 2a — Z+ Pe ie + aii. 
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Here are some examples of a-particle reactions: 
(a, P) : _N"* ale 2a rag ei ay iP? 
pA + 2c" ae aot: + iP 3 
(a, n):,Be’+ ,a@*—> ,C-+ jn’; 
;B"° + a —_ ,N*® + on’. 





The first of the above (a, n) reactions is of great practical impor- 
tance since a mixture of radium (a-particle source) and beryllium 
is a common neutron source. 

A large class of reactions occur as the result of collisions with pro- 
tons. Such reactions include (p, a) reactions (in which a proton is 
captured and an a-particle is ejected), e. g., 


ce + ib" ants ,O°+ He’, 


and (p, n) reactions (in which the ejected particles are neutrons). 
Capture reactions which are not accompanied by the ejection of a 
particle also occur. The excess energy is released in the form of 
y-rays. Therefore, such reactions are designated as (p, y) reactions, 
eS. 
;Li’+,p’— ,Be’. 


Reactions involving deuterons (d), e.g., (d, p) and (d, n) reactions, 
have been subjected to careful study. When heavy water (deuterons) 
is bombarded with deuterons, a radioactive isotope of hydrogen, 
viz., tritium, may be formed: 


,D* — qr ai ee 1 iD’. 
However, a (d, n) reaction may also occur: 
,D?+ ,D*’— ,He* + ,n'. 


Reactions involving neutrons are of great importance in nuclear 
engineering since they occur abundantly in nuclear reactors. Such 
reactions include (n, a), (n, p), (n, 2n) and (n, y) reactions. In addi- 
tion, reactions involving the fission of heavy nuclei occur under 
the action of neutrons (see below). 

Two kinds of reactions occur between nitrogen and neutrons: 


(n, p):,N'*+- of + CT iD} 
(n, 2n):,N’*-++ ,n*'—+ N+ 2)n'. 

The first of these reactions yields a carbon isotope of long lifetime 
(more than 5,000 years). This isotope is of great importance in bio- 
chemical investigations. 

Almost all isotopes arecapableof capturing a neutron [(n, y) reac- 
tion]. In this manner, an isotope of the same element, the mass of 
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which is one unit greater than the original isotope, is formed. Usual- 
ly radioactive isotopes (B-radioactivity) are produced. 

Exothermic and endothermic reactions are possible in nuclear chem- 
istry, like in molecular chemistry. The magnitude and sign of the 
thermal effect can be determined using the principle of the equiva- 
lence of mass and energy. 

If nuclei of masses M, and M, are formed from nuclei of masses 
M, and M,, then 

M,+M,=M,+M,+ Am. 

When Am>0, i.e., when the nuclei of the reaction products have 
less mass than the original nuclei, the reaction is exothermic. When 
Am<0O, i.e., when the nuclei of the reaction products have greater 
mass than the original nuclei, the reaction is endothermic. The ener- 
gy released or absorbed during a reaction can be determined from the 
formula € =c*Am. Calculated and experimental results are in perfect 
agreement in all cases. 

In most nuclear reactions, the thermal effects are of the order of 
millions of electron-volts for each pair of reacting nuclei. This is 
millions of times greater than the corresponding values in chemical 
reactions. 


Sample calculation. Let us consider the reaction 
Be? 7 xp" = sLi* +a". 
The following are tabulated handbook values of the masses occurring in this 
reaction: mp.=9.01503, mp=1.00812, m,,;=6.91697 and m,=4.00390. The 
reaction is exothermic since Am=0.00228, i. e., Am>0. Since an atomic weight 


unit corresponds to 931.8 Mev, the thermal effect is equal to 0.00228 x931.8= 
=2.12 Mev or about 8X 107™ calories (1 Mev=3.827X 107 ** calories). 


222. Fission Reactions of Heavy Nuclei 


Atomic nuclei which are heavier than tin nuclei are capable of split- 
ting into two parts of approximately equal mass. In order for this 
fission to occur, a considerable amount of activation energy must be 
imparted to such a nucleus (the greater the isotopic mass number the 
smaller the activation energy). For nuclei of uranium, thorium and 
palladium, this energy is equal to about 5 Mev. The fission reactions 
of such nuclei are exothermic; the energy released considerably 
exceeds the activation energy. 

Fission may be caused by protons, deuterons, a-particles or y-ra- 
diation. However, fission caused by a neutron hitting a heavy nucleus 
is of paramount significance. 

Let us consider fission of uranium-235 under the action of neu- 
trons. When a neutron is captured, a uranium isotope with a mass one 
unit greater than U*** is formed: 


235 1 1236 
al yl J . 
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Fig. 246 


Fission occurs if the activation energy of U’** is less than the energy 
supplied by the neutron. The captured neutron brings to the nucleus 
an energy equal to the binding energy of the neutron plus the kinet- 
ic energy of its motion. In the case of U?**, the energy supplied by a 
neutron of thermal velocity is sufficient to produce fission. 

Since a large amount of energy is required to split a nucleus of 
U**, the nucleus of this isotope can be split only by means of fast 
neutrons. 

A nucleus of U*** generally divides into two unequal fragments. 
Nuclei split randomly and yield various primary products. Fig. 246 
shows the yield of uranium-235 fission products as a function 
of mass. It is seen from the curve that division into equal fragments 
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has a minimum probability. Most frequently the masses of the frag- 
ments stand in a ratio of about 2 to 3 (e.g., Sr°* and Xe’**). These 
nuclei have enormous energies—a light nucleus has an energy of about 
100 Mev and a heavy nucleus 65 Mev. The large variety of fission 
products is due not only to the fact that a nucleus may split in numer- 
ous ways, but also to the fact that a number of other products are 
created when radioactive fragment nuclei decay. 

Nuclear fission products have an abnormally large number of neu- 
trons. By means of a chain of transformations, radioactive nuclei as- 
sume a normal stable state. For example, 


137 137 
on 


187 137 
- gre ——> ,,Cs'?? ———>_ Bar”. 
30 sec 3.4 min 33 years 


The half-life is indicated under the arrows. It is evident from the val- 
ues of decay time that, practically speaking, only Cs’*’ is obtained 
as the product of this chain of transformations. 

Another very important fission product of uranium-235 is stron- 
tium-90 (see below). 

Nuclear fission of uranium-235 is accompanied by the release of 
an enormous amount of energy. Thus, 1 gram of uranium yields as 
much energy as the combustion of 2.5 tons of coal, i.e., 22,000 kwhr. 
Most of the energy is released in the form of kinetic energy of fis- 
sion fragments and about 10% is released in the form of radiation. 


223. Chain Reactions 


All aspects of the uranium-235 reaction have been studied in 
detail. Uranium-235 is the only natural isotope from which ener- 
gy can be obtained on an industrial scale. 

Since neutrons are required to achieve fission and a neutron gas 
does not exist in nature, a large amount of energy can be obtained 
only by means of a chain reaction in which new neutrons are contin- 
uously created. This is precisely the method used to achieve nuclear 
fission of uranium-235, Every time a U-235 nucleus is split sev- 
eral neutrons are liberated. The number of such neutrons varies 
from case to case, but on the average 2.5 neutrons per fission are ob- 
tained. If the neutrons formed when a nucleus splits are able to 
cause fission of other uranium atoms, a chain reaction may occur. 

Since neutrons have a long range, the neutrons formed upon fis- 
sion of a uranium-236 nucleus have a large probability of escaping 
from the substance without splitting other nuclei. Moreover, it 
should be realised that not every encounter between a neutron and 
a uranium-235 nucleus results in fission. 

The growth of a chain reaction is a function of the neutron mul- 
tiplication factor. The value of this quantity (K,) can be calculated 
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for the case of a system having infinite dimensions. To perform this 
calculation, onemust know the neutron multiplication due to fission 
by slow and fast neutrons and also the probability of neutron cap- 
ture by nuclei of nonfissionable materials. 

The extent to which K, exceeds unity is a measure of the increase 
in the number of neutrons of a given generation over the preceding 
generation. 

However, a reactor has finite dimensions. Therefore, the neutron 
multiplication factor must be written in the form K=K,(l1—p), where 
p is the probability that a neutron will escape from the reactor. 
In order for the reactor to operate, K, must be greater than unity. 
During reactor operation, the multiplication factor K must be exact- 
ly equal to unity. 

The dimensions of a system containing nuclear fuel are said to be 
critical when the multiplication factor of the system is equal to 
unity. 

Let us examine the factors which affect K. The probability of a neu- 
tron colliding with another nucleus before escaping from the sub- 
stance may be increased by amassing a large quantity of nuclear 
fuel. The same end is achieved by reducing to a minimum the 
number of atomic nuclei capable of absorbing neutrons, since such 
absorption removes neutrons from the reaction. The probability 
of capture may be increased by slowing down the neutrons. (When 
a nucleus splits, fast neutrons are formed, but a uranium-235 nu- 
cleus is best able to capture slow neutrons, i.e., so-called thermal 
neutrons.) 

Nuclei of uranium-235 may be split by fast neutrons as well as by 
slow ones, but the probability of being split by the former is less. If 
the nuclei of a substance may be split only by fast neutrons, it be- 
comes impossible to produce a chain reaction: aneutron created during 
fission of a nucleus and slowed down even slightly as the result of one 
or two chance encounters which do not result in fission will be lost 
to the chain reaction. 

At present, the most important nuclear fuels are able to produce a 
chain reaction with slow neutrons. Such fuels include one natural 
isotope (uranium-235) and two artificial elements, viz., plutonium- 
239 (obtained from uranium-238) and uranium-233 (obtained from 
thorium-232). 

The quantity of nuclear fuel in a particular volume must exceed 
a certain minimum in order for a chain reaction to begin. We need 
not be concerned about the first neutron since, thanks to cosmic radia- 
tion, a small number of neutrons are always present in the atmos- 
phere. Moreover, spontaneous fission, i.e., fission due to internal 
forces, also occurs occasionally. (This phenomenon was discovered 
by the Soviet scientists Flerov and Petrzhak. They found that it 
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is not always necessary for aneutron to be captured in order for fis- 
sion of a uranium-235 nucleus to occur.) Finally, a mixture of radium 
and beryllium may serve as a source of initial neutrons. 


224. Principle of Operation of a Nuclear Reactor 


If a chain reaction begins in a given mass of nuclear fuel and if 
the reaction is uncontrolled, the result will be an explosion since. at 
each instant the number of neutrons, and hence the quantity of released 
energy, will be greater than at a preceding instant. The quantity 
of energy released in a small fraction of a second will be so great that 
an explosion results. 

In order to release a constant or controlled amount of energy, one 
must build an installation in which the neutron multiplication fac- 
tor can be controlled. An installation which allows us to do this is 
called a nuclear reactor or pile. In such an installation, it must be 
possible to begin a chain reaction with amultiplication factor slight- 
ly greater than unity. Then, the concentration of neutrons inside 
the pile, and hence the power of the reactor, will begin to increase. 
Alter raising the power to the desired level, one must be able to set 
the multiplication factor exactly equal to unity. The reaction then 
becomes self-sustaining since the number of neutrons and the energy 
released per unit time remain constant. 

A reactor should be constructed in such a manner that the neutrons 
formed upon fission are utilised most effectively. But effective util- 
isation of neutrons does not mean that they must be used exclu- 
sively for the fission of nuclei and the release of energy. Substances the 
nuclei of which absorb neutrons may be introduced into the pile. 
By means of reactions with neutrons, we can obtain large quantities 
of useful artificial radioactive isotopes and, what is extremely im- 
portant, artificial nuclear fuel. Thus, a nuclear reactor may be used 
ee to produce energy, but to produce artificial isotopes as 
well. 

Neutrons formed upon fission have a velocity of tens of thousands 
of kilometres per second. The velocity of thermal neutrons is of the 
order of | km/sec (0.025 ev). Such slow neutrons are most effective in 
producing fission. 

Nuclear reactors operating with a natural or enriched mixture of 
uranium-235 and uranium-238 are of great importance. 

Resonance absorption of neutrons will occur in uranium-238. 
Strong absorption occurs, for example, at an energy of 7 ev. When 
a reactor is loaded with a mixture of isotopes, it is absolutely essen- 
tial to decelerate neutrons below this energy value. 

Thus, the basic elements of a reactor are fuel, a neutron modera- 
tor, aneutron absorber to control the multiplication factor, and shield- 
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ing to protect personnel from neutrons and y-radiation emitted dur- 
ing nuclear transformations occurring in the reactor. 

The small RFT uranium-graphite reactor of the U.S.S. R. Acad- 
emy of Sciences is a good example of a well-designed reactor. This 
reactor is used for various physical investigations and for the produc- 
tion of artificial isotopes. Its power is 10,000 kw and the flux of 
thermal neutrons at the centre of the reactor is equal to 8 x 10** neu- 
trons/em* sec. The core is in the form of acylinder the diameter and 
height of which are equal to | metre. The cooling water and the graph- 
ite serve as aneutron moderator. A number of horizontal and verti- 
cal holes, through which beams of neutrons emerge, pass through the 
reactor shielding. Materials to be irradiated are placed in the reactor 
core through special apertures provided for this purpose. 

A considerable number of nuclear reactors have already been put 
into operation or are in the design stage. Such reactors may differ 
from one another with respect to: 1) the type of fuel used (pure nu- 
clear fuel, enriched fuel, or natural uranium in the metal form or in 
the form of a chemical compound); 2) the fuel pattern (space lattice, 
rod lattice, or uniform distribution of the fuel in a solution or sus- 
pension); 3) the moderator (light or heavy water, graphite or beryl- 
lium); and 4) the type of cooling (water, gas or nocooling). Reactors 
may be designed to have any power ranging from a fraction of a kilo- 
watt to hundreds of thousands of kilowatts and may operate with 
slow (thermal) or fast neutrons, depending on the type and quantity 
of moderator used. 

Reactors are designed to be controlled completely automatically. 
Monitoring is achieved by means of neutron detectors located in the 
reactor walls. Such detectors are able to measure neutron fluxes rang- 
ing from 1 neutron/em? sec to 5X 10'° neutron/cm® sec. The neu- 
tron-sensitive material of a detector may consist of boron-10 or ura- 
nium-235 (fused to the electrodes of an ionisation chamber) or bor- 
on fluoride gas (i.e., the ionisation chamber is filled with BF, 
gas). In the former case, the chamber is filled with argon, nitrogen, 
helium or air at a pressure of 2 atmospheres. 

The operation of the ionisation-current amplifiers in the various 
relay mechanisms which transmit the neutron detector signal to the 
actuators controlling the motion of the control rods and safety rods 
must be extremely reliable. 

The position of the control rods varies during reactor operatien. 
This is because the quantity of material which absorbs neutrons in- 
creases as the decay products accumulate. Some of this “poisonous” ma- 
terial, e.g., gaseous products, may be removed from the pile automat- 
ically. Nevertheless, gradual withdrawal of the control rods is neces- 
sary to maintain the neutron density at a constant level. After a pe- 
riod of time of the order of 5-20 months the reactor becomes so 
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“poisoned” that further operation becomes impossible. The pile must 
be cleansed of decay products and reloaded with new fuel. Fission 
products which are strong absorbers of neutrons include ruthenium- 
103, xenon-131 and 135, neodymium-143, samarium-149 and 
151, europium-151, 152 and 155 and gadolinium-155. 

Of course, the formation of artificial fuel also causes the multipli- 
cation factor to change. 

If in addition to nuclear fuel there is a quantity of uranium-238 
or thorium in a reactor, such a reactor will not only liberate heat 
but will produce artificial nuclear fuel—plutonium from uranium- 
238 and uranium-233 from thorium. 

A reactor in which the quantity of fissionable material increases, 
or at least remains unchanged, is called a breeder. In constructing 
such a reactor, one must use a fissionable material which allows us 
to obtain an average number of neutrons per fission of more than two. 
The condition for fuel reproduction is that the number of atoms split- 
ting each second be no less than the number of uranium-238 or 
thorium atoms transformed each second into plutonium or uranium- 
233, respectively. 

In order to transform the liberated heat into electrical energy, 
one must proceed exactly in the same way as in the case of thermal 
electric power plants operating on coal. Heat extraction from a reac- 
tor is a very difficult engineering problem. However, this problem 
has been solved. The first atomic electric power station has been 
operating in the U.S.S.R. for several years and one has also been 
put into operation in Great Britain. During the next few years, a 
number of large atomic electric power stations will be constructed 
in the Soviet Union. 


225. Artificial Radioactive Products 


Nuclear reactors constitute the most prolific source of radioac- 
tive products. Every hour of reactor operation yields a definite quan- 
tity of products of nuclear fission. This process cannot be controlled 
so as to obtain desired products. “Common” fission products having 
a relatively large (sufficient for practical utilisation) decay 
period include: 


Kr’, Sr°?, Sr®’, | Falck Re: Xess, Cs18? and Bal*®. 


With the further development of nuclear engineering, the quan- 
tity of radioactive fission products (fragments) obtained in reactors 
will reach imposing values. A reactor having a power of | kw yields 
a quantity of products which radiates 10° curies in 100 days. At pres- 
ent, by no means all of them have practical applications. Of most 
jnterest are caesium-137 and strontium-90. These have a relatively 
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long lifetime and are obtained in reactors in relatively large quan- 
tities. The atomic industry produces them in pieces with an initial 
radioactivity of the order of 1,000-2,000 curies. This is equivalent 
to the radioactivity of 1-2 kg of radium. 

Radioactive fission products are used in research work, production 
(process and quality control), medical treatment, etc. Some radio- 
active fission products are used to sterilise food products, antibiot- 
ics, vaccines, etc. Such sterilisation requires a very large dose. This 
is obtained from radioactive preparations yielding a thousand curies 
over a very short period of time—of the order of minutes or hours. 
The radioactivation of irradiated objects is not dangerous since 
fission products radiate B- and y-rays and these do not produce appre- 
ciable radioactivity in an irradiated substance. If packaged goods or 
goods protected by anatural envelope, e.g., eggs or fruit, are subject- 
ed to y-ray sterilisation, the bacteria are destroyed and new bacte- 
ria cannot penetrate through the packing or envelope. Large-scale 
sterilisation by radioactive irradiation may replace refrigeration in 
many cases. 

An important field of application of radioactive isotopes is ra- 
diography. Cobalt-60* and caesium-137 replace X-ray machines in 
the flaw detection of metals. The use of y-rays instead of X-rays means 
that an X-ray tube and its high-voltage equipment are replaced by 
cheap fission products: the basic equipment of a radiographic labor- 
atory may be reduced to an ampule of radioactive matter and a 
fluorescent screen or photoplate. Radiography, moreover, has broad- 
er applications. For example, the inside of a pipe may be examined 
with y-rays, something that is physically impossible to do with X-rays 
because of the size of an X-ray tube. Artificial isotopes may replace 
radium salts used to paint the dials of clocks and other instruments. 

By means of strontium-90, air may be ionised to eliminate static 
electricity, the accumulation of which is undesirable in various in- 
dustries. It is useful to ionise the air in combustion engine chambers 
since this increases the rate of combustion. 

By means of strong f-radiators, one can construct low-power 
sources of electric current (less than 1 milliwatt). There are several 
ways of creating electrical devices by means of streams of electrons 
from B-radiators. One of the most promising methods is the irradia- 
tion of germanium or silicon with electrons. Layers possessing uni- 
directional electrical conductivity may be manufactured from these 
elements. The distinguishing feature of semiconductors of the gtr- 
manium or silicon type is that under the action of external electrons 
a large number of electrons are set free inside these substances. As 
a result of unidirectional displacement of the liberated electrons, a 


* Cobalt-60, which is not a fission product, will be discussed later. 
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potential difference of the order of 0.2-0.3 volt is created across a 
semiconductor pair. The efficiency of such an electrical device can 
be gleaned from the fact that one external electron produces up to 
200,000 internal electrons. Therefore, in spite of the fact that the 
efficiency in the utilisation of radioactive radiation is only 1%, a 
source of 50 millicuries with a half-life of 20 years (strontium-90) 
enables us to produce a low-power generator of electrical energy at 
an insignificant cost. 

A specific isotope may be obtained in two ways: by means of a nu- 
clear reactor or by nuclear bombardment in an accelerator. 

When we place a substance in a nuclear reactor, it is subjected to 
the action of neutrons. Since most elements easily react with neu- 
trons, it is possible to obtain radioactive isotopes of almost all the 
chemical elements. As a result, nuclear reactors constitute the main 
source of radioisotopes. 

Irradiation with neutrons is accomplished as follows. The substance 
to be irradiated is placed in an aluminium capsule the length of 
which is about 7cm, diameter about 2 cm, and wall thickness 0.1- 
0.2 mm. If the substance is volatile or may react with aluminium, 
it is placed in a quartz ampule, which is wrapped with quartz fibre 
before being inserted into the aluminium capsule serving to protect 
it mechanically. Ordinary glass cannot be used since it is a strong 
absorber of neutrons. If a large object is to be subjected to irradiation, 
a big aluminium box must be constructed forit. Reactors designed 
for the production of radioisotopes are provided with tunnels of 
various cross-sections, through which aluminium boxes and capsules 
may be introduced into the reactor to a great depth. After irradia- 
tion, the boxes and capsules are placed in lead containers. 

Radioisotopes are formed in a reactor as the result of neutron 
capture or the dislodgment from the bombarded nuclei of protons, 
and, less frequently, a-particles, by neutrons. At first the number of 
radioactive atoms in an irradiated substance increases uniformly with 
irradiation time. Then, the rate of increase in activity decreases 
until, finally, saturation sets in. For different substances, this proc- 
ess proceeds at different rates, depending on the half-life of the ele- 
ment. The rate of increase in activity is uniform as long as the half- 
life is 5-10 times greater than the irradiation time. Saturation sets 
in when the irradiation time becomes approximately 5-10 times 
greater than the half-life. 

It should be noted that a nuclear reactor cannot produce the var- 
iety of isotopes produced by a cyclotron. This is quite understand- 
able since in a reactor the conditions of bombardment of atomic 
nuclei are restricted by the nature of the bombarding particle, i.e., 
the neutron, and its range of energies. But there are even objections 
to the use of reactors to obtain large quantities of an isotope such as 
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C™. The initial material introduced into a reactor to obtain C™ is 
an excellent absorber of neutrons and therefore cannot be used in 
large quantities. Thus, accelerators have a definite role of their own 
to play in the production of radioisotopes. 

The effectiveness of an accelerator is determined by the energy 
and number of nuclei ejected from the accelerator per unit time. 
The number of nuclei can be easily calculated from the average ion 
current, which, in acyclotron, is equal to 107"* a. Knowing the charge 
of an ion, one can easily determine that 10° nuclei, or 2x 10-'* gm 
in the case of deuterium, are obtained each second. 

In addition to the above facts, one must also know the “effective 
cross-section” of the reaction in order to determine the rate at which 
a particular radioactive substance will be formed under the action 
of a beam of nuclei. This quantity is designated by o and its value is 
given in cm’. It has the following significance. If S is the area of 


a bombarded sample, — is the probability that the nuclear projectile 


will hit the target nucleus and produce the given reaction. The order 
of magnitude of o is usually about 10-**cm?. In the case of certain 
nuclei, the absorption of neutrons increases sharply at definite neu- 
tron velocities. For example, cadmium is a strong absorber of slow 
neutrons with an energy of 0.18 ev. The effective cross-section of 
this reaction is of the order of 7,000 x 10-** cm’. 

The cross-section of a given reaction depends to a great extent on 
the energy of the bombarding particle. If a sharp peak occurs in a 
curve of cross-section plotted against incident particle energy, the 
peak value is referred to as the resonance cross-section. 

It is evident from the values given above that the production of 
significant quantities of isotopes often requires many weeks of ac- 
celerator operation. 

The radioactive elements obtained from reactors and accelerators 
are widely employed as tracers (whence the designation “tagged 
atoms”) in practically all branches of science and engineering. The 
mass of material required for all such purposes is not very large, 
but since elaborate industrial facilities serve to produce only small 
amounts of radioactive isotopes this industry is rapidly expanding 
to meet the growing demands. 

The presence of a radioactive isotope in a substance is generally 
determined by means of a counter. In this manner, an activity of 
less than 10~** millicurie can be measured. Thus, a quantity Bf ra- 
dioactive phosphorus having a weight of less than 10~** gm can be 
detected. 

Widely used isotopes include the following: Cobalt-60, which is 
f~-radioactive with a half-life of 5.2 years; since it is a strong f-ra- 
diator it is widely used in gamma-raying and irradiation. Carbon-i4, 
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which has a half-life of 6,360 years; widely used in biochemistry, 
geochemistry, and in the study of the kinetics of chemical reactions. 
Phosphorus-32 and sulphur-35, which are B~-radioactive with a half- 
life of 14.3 and 87 days, respectively; one of their most important uses 
is in agriculture in the study of fertiliser assimilation by plants. 


226. Thermonuclear Reactions 


Theoretical calculations indicate that atomic nuclei of almost. 
all elements can serve, in principle, as sources of energy. It turns 
out that any nucleus heavier than the nucleus of a silver atom pos- 
sesses more energy than the components into which it may be divid- 
ed. All heavy nuclei release energy when they split up. The heavier 
the nucleus, the greater the magnitude of this energy. That is why 
uranium is the “best” nuclear fuel. 

However, light nuclei can also serve as sources of energy. Theo- 
retical calculations indicate that a nucleus obtained by the fusion 
of two light nuclei will possess less energy than the original parti- 
cles. Therefore, energy isreleased upon fusion of light nuclei. Here, 
too, the further away from the mid-point an element stands in the 
Mendeleyev periodic table, the greater the amount of energy re- 
leased in such a reaction. The greatest amount of energy is obtained 
from the fusion of nuclei of hydrogen atoms. 

What conditions are required to achieve fusion between light par- 
ticles? Nuclear bombardment cannot yield the desired result since 
a charged particle is decelerated rapidly in a substance. The only 
way to achieve fusion is to raise the temperature. It is easy to calcu- 
late the temperatures that atomic nuclei require to enable them 
to approach each other closely, i.e., to overcome electric repulsion. 

Calculations indicate that the temperature at the centre of the 
Sun is 20 million degrees. Knowing that the energy associated with 


each degree of freedom is equal toy kT, onecan determine the average 


kinetic energy of a particle at this fantastically high temperature. 
This energy is equal to merely 3,000 ev. Now, let us calculate from 


the formula for the potential energy of electrical interaction, u=s, 


how closely two protons will approach each other. It turns out that 
the distance will be equal to5 x 10~** cm. As we know, the radius of 
a nucleus is considerably less than this value. Nevertheless, thermo- 
nuclear reactions, i.e., reactions occurring at a high temperature, 
are possible in the Sun. Calculations which take into account the tun- 
nel effect, and the fact that in every gas, including a gas of nuclear 
particles, there are particles the velocities of which considerably ex- 
ceed the average, indicate that in a year one atom in a million 
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takes part in nuclear fusion. This small fraction is sufficient to 
account for solar activity. 

Under terrestrial conditions, such high temperatures have been 
repeatedly created in the United States and the Soviet Union during 
hydrogen bomb tests. Temperatures of scores of millions of degrees 
are created during uranium bomb explosions. If a substance whose 
nuclei are capable of combining and releasing energy is located 
within the zone of this explosion, a thermonuclear reaction, the 
energy of which is many times greater than that of a uranium 
bomb, will occur. The uranium bomb in this case serves to trigger 
the thermonuclear reaction. 

Here are examples of the most easily realised reactions which re- 
lease large quantities of energy: 


D?’ -+- H’ = He’; Li’ + H' = 2Het; 
H? + H = He’: C3 ae H'! =N". 


Thermonuclear reactions can occur only at temperatures which 
give nuclei a thermal velocity sufficient to overcome, with appre- 
ciable probability, the Coulomb potential barrier. 

Of greatest interest are the reactions in deuterium and in mixtures 
of deuterium and tritium since these require the least amount of 
energy. A temperature of 200,000° is required to obtain one neutron 
per second in a gram of deuterium (in accordance with the reaction 
,D’+ ,D? =,He*+,n’). In a highly rarefied gas, an even higher tem- 
perature (of the order of 500,000°) is required for the same purpose. 
At such a temperature, deuterium (like other substances) will form 
a plasma of nuclei and electrons. To transform deuterium to this 
state requires very little energy—of the order of several kilowatt-hours. 
However, the difficulty does not lie in transmitting this energy to 
deuterium, but rather in providing thermal confinement, i.e., the 
deuterons must preserve the corresponding kinetic energy for a long 
period of time. 

A possible solution to this problem was proposed by Sakharov and 
Tamm. They suggested that thermal confinement be achieved by 
‘utilising the electrodynamic forces which constrict parallel currents. 
When a voltage is applied across a plasma, it is constricted, i. e., 
a plasma column is formed (pinch effect). This constriction reaches 
its culmination when the temperature of the plasma increases to*a 
million degrees. At such instants, deuterium nuclei initiate a ther- 
monuclear reaction. This theory has been confirmed by Soviet scien- 
tists and the method may prove to be the most practical means of 
utilising the energy of thermonuclear reactions. Experiments have 
been conducted with electric discharges in tubes with discharge gaps 
of up to 2 metres, and currents of 100,000 to 2,000,000 amperes have 
been investigated. 


CHAPTER XXXII 


ATOMIC STRUCTURE OF BODIES 


227. Polycrystalline Substances and Monocrystals 


As a rule, crystals begin to grow around a very large number of 
centres in a melt or solution. If special measures are not adopted, a 
polycrystalline substance rather than a monocrystal will form as the 
result of crystallisation. Under a microscope such a substance seems 
to consist of individual grains (see Fig. 247). Each grain is a crystal 
which has an irregular haphazard form due to the fact that its normal 
growth has been impeded by neighbouring crystals. Most bodies com- 


monly encountered, particularly 


metals and rocks, are polycrystal- 


TR line substances. 
YY . The boundary between grains is 
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of a substance accumulate at the 
grain boundaries. The interlayer 
between crystals differs from the 
“body” of the grain not only in 
that it contains foreign atoms, but 
in that its atoms have a disturbed 
(transitional) arrangement. The bas- 
ic structure of the boundary be- 
tween grains is clearly visible under 
a microscope as peculiar, smooth 
“paths”. The usual size of grains 
in metals and rocks is 107*-10-* cm. 

A single crystal (monocrystal) of any crystalline substance may be 
found in nature or artificially grown. A monocrystal is distinguished . 
by its regular shape, i.e., plane faces, straight edges, and symmetry, 
in other words, the proportionality of its component parts. This reg- 
ular shape reflects a crystal’s internal properties, which enable us 
experimentally to distinguish a crystal from a bit of material giv- 
en such a shape artificially. It is also not difficult to recognise a crys- 
tal when its characteristic features are hidden. Thus, a sphere may 
be fashioned from a large crystal of rock salt but, when it is placed 
in water, surface material is dissolved at a nonuniform rate and, 
as this process goes on, its accidental shape tends to be transformed 


Fig. 247 
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into the polyhedral shape which is natural for this substance. A 
monocrystal can be easily distinguished from a polycrystal by means 
of X-ray analysis. 

A naturally formed crystal has the shape of a polyhedron. As in 
the case of every polyhedron, a crystal has a certain number of faces 
(p), edges (r) and corners (e), 


which are related to one another oe: 
as follows: p-+e=r+2. For J} SS 
example, a cube has 6 faces, 8 EF 
corners and 12 edges. m 

Crystal faces are arranged in cama 
bands or zones. A system of faces \VY A 
the intersections of which are 

aa 


parallel edges is called a zone, 
and the direction of these edges 
is called the axis of the zone. 
Crystals of one and the same 
substance may differ considerably 
with respect to shape, but it has 
been long known that a given 
substance has characteristic an- 
gles between faces and edges. 
(Depending on chance, one com- 
ponent of a crystal may grow 
more than another; as a result, 
apparently, the proportionality Fig. 248 
between components may be up- 
set.) This important rule, which may be called the law of constant 
angles, is illustrated in Fig. 248. In the figure, we see four different 
crystals of silicon dioxide. It is seen that the number of faces and 
their relative dimensions differ from specimen to specimen, but the 
angles between corresponding faces and edges remain unchanged. 





228. Space Lattice 


The distribution of matter in a crystal may be represented by a 
three-dimensional periodic function. This is the fundamental rule 
lying at the basis of crystal investigations. 

Fig. 249 shows a wall-paper pattern. Acertain element of this pat- 
tern is repeated in two directions. Consider any point A in the 
figure. A system of lines may be drawn through the selected points 
(nodes) as shown. A pattern element the repetition of which yields 
the full pattern is enclosed within a cell of the resulting grid. Evi- 
dently, the entire pattern can be obtained from asingle cell by means 
of parallel translations of the cell vectors a and 6. 
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A crystal constitutes a space lattice, not a plane lattice. An ele- 
ment of a crystal is a parallelepiped based on three translational 
vectors a, b, c, which may be selected, generally speaking, in an in- 
finite number of ways. Such a par- 
allelepiped will be called an ele- 
mentary or unit cell, the vectors a, 
b, c the basic translational vectors, 
or simply vectors, and their lengths 
a, b, cthe basic repetition periods 
or lattice spacings. The lattice is 
described in a system of coordinates 
the axes of which coincide with 
the directions of the basic vectors. 
Different ways of selecting basic 
vectors, i.e., an elementary cell, 
are illustrated for a two-dimen- 
sional case in Fig. 250. An elementa- 
ry cell in the general case is an 
oblique-angled parallelepiped with 


edges a, b, cand anglesa=06,c, B= 


=C,a, y=a, b. The six quantities 
which uniquely describe an elemen- 
tary cell are called its parameters. 
Since the entire lattice is deter- 
mined when an elementary cell is 
given, the above quantities are 
sometimes called the parameters 
of the lattice. 

A cell in the form of an oblique-angled parallelepiped is said to 
be triclinic and ifa =y=90°, monoclinic. A cell in the form of a right- 
angular parallelepiped is said to be rhombic and if in additiona—8, 
tetrahedral. If a=b#c, a=~=90°, and y=120°, a cell is said to be 
hexagonal. The simplest cells have the form of a cube. 

If one of the lattice points is selected as the origin of the coordi- 
nate system, the radius vector of any other lattice point is given by 
the formula 





Ranp= ma + nb + pe, 


where m, n, p are whole numbers representing the coordinates of 
these nodes. The indicated numbers are called the indexes of the 
nodes. The set of three indexes describing lattice points is designated 
by the nodal symbol [[mnp]]. 

There are an infinite number of nodal lines and nodal planes. 
Nodal lines and nodal planes are represented in a lattice by infinite 
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families of parallels. The transition from one line to another of the 
same family, or from one plane to another, occurs by translation along 
a vector joining two nodes of these lines or planes. Each family 
of nodal lines is described by the lattice spacing along a nodal line 
and the direction, i.e., incline, to the selected coordinate axes. 
To describe a family, we select the line passing through the origin 
of the coordinate system. A nodal line is described uniquely by the 
indexes u, v, w, of the first lattice point lying on this line. The indexes 
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Fig. 250 


of this lattice point are called the indexes of the line and are desig- 
nated by [uw]. If an index is negative, a minus sign is placed above 
the numeral. The symbol [100] represents the a-axis of the lattice, 
[010] the b-axis, and [001] the c-axis. The lines [011] and [011] 
represent plane diagonals in the face bc. Of course, [011] and [011] 
are one and the same line. Distinguishing between these two designa- 
tions has significance only if we wish to emphasise the polarity of 
the direction. The spatial diagonals of a cell have the symbols [111], 
(111], (111) and [111]. There are four of them, corresponding to the 
existence of eight quadrants; the other four symbols represent the 
same lines, but with reverse polarity. Thus, [111] is anti-parallel 
to [111], etc. 

A space lattice can be constructed as follows. First, an infimite 
plane-lattice (nodal plane) is formed by means of two translational 
vectors; then, a space lattice is formed by means of a third transla- 
tional vector which does not lie in this plane. A crystal lattice can 
be represented by families of nodal planes in an infinite number of 
ways. Every family of nodal planes consists of parallel planes sepa- 
rated from one another by equal distances. For a given lattice, spec- 
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ification of the interplanar distance and the orientation of one of 
the planes relative to the selected coordinate axes completely de- 
scribes a family of nodal planes. It is also sufficient to give the orien- 
tation relative to the selected axes of the plane closest to the origin. 
The distance of this plane from the origin will be equal to the inter- 
planar distance of the given family. 

Let this plane intersect the lattice axes at the coordinates = ‘ a 
and 5, i.e., fractions of the basic lattice spacings. The numbers 4, 


k and J, which describe the orientation of the plane, will be called the 
indexes of the plane. It is easily seen that A, k and / are wholenumbers. 
One way of showing this is as follows. Consider a plane passing through 
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Fig. 251 


an initial lattice point and another plane, of the same family, 
displaced by an amount a. This is shown in Fig. 251. Other planes 
will pass through these nodal planes, but they must be separated from 
one another by equal distances. Therefore, the repetition periods along 
the selected axes will be divided by the nodal planes into a num- 
ber of equal parts. 

1 


The plane closest to the origin and intersecting the axes at a a 


and + of the lattice spacings is described by the set of three indexes h, 


k and 1. Its symbol is designated by enclosing these indexes between 
round brackets: (Akl). For example, the plane (236) intersects the 


axes at the coordinates a = and + . Any plane which intersects 
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the axes at coordinates which are a multiple of these values is a 
member of this family. Thus, in the case under consideration, the suc- 
cessive planes beyond the one closest to the origin will intersect the 
axes at the following coordinates: a, 2, ae 3a, b, SS etc. 

If a plane intersects the axes at negative coordinates, this is indi- 
cated by a minus sign above the corresponding index. It is evident 
that the planes (Akl) and (ARI) 
belong to the same family. 
Therefore, all the signs of the 
indexes of a plane may be re- 
versed. 

If a plane is parallel to a 
coordinate .axis, the corre- 
sponding index is equal to 
zero. Thus, (110) is a plane 
that is parallel to the c-axis, 
(001) is the lattice plane ab 
(see Fig. 252), (010) is the 
plane ac, and (100) is the 
plane bc. Planes passing 
through one of the axes and 
one of the diagonals have in- 
dexes consisting of two ones 
and one zero. For example, the Fig. 252 
plane (101) is a plane which 
is parallel to the b-axis and passes through the diagonal extending 
from the terminal of vector +a@ to that of vector -+c (not the diago- 
nal passing through the origin). The plane (101), which passes 
through the terminals of the vectors —a and —c, belongs to the same 
family. The plane (101) and its “reverse side” (101) are also parallel 
to the b-axis and pass through the diagonals ac which do not begin 
from the zero lattice point, but extend from the terminal of vector 
+a to that of vector —e and the terminal of vector —a to that of 
vector --e, respectively. 

A symbol consisting of three units refers to planes passing through 
three diagonals. These planes pass through the terminals of all three 
lattice vectors. Thus, the plane (111) passes through the terminals 
of the vectors —a, —b and +e. 

The indexes of a family of nodal planes are at the same time the 
indexes of crystal faces. Two parallel faces have the indexes (Ak/) 


and (Akd). 
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229. Cell Selection and Crystal Symmetry 


The vectors a, Bb and ¢ of a lattice may be selected in a variety of 
ways. If there are no lattice points within an elementary cell, we 
call it a primitive cell. 

Various ways of selecting a primitive elementary cell are shown 
in Fig. 253. In view of the periodicity of a space lattice, the magni- 
tude of the volume associated with each lattice point is a constant. 
Its value is equal to the volume of a primitive elementary cell, re- 
gardless of the manner in which such a cell is selected. Since each of 





Fig. 253 


the eight lattice points at the corners of such a cell is “shared” by eight 
cells, — of the lattice points belongs to the given cell. Thus, on the 


average, there is one lattice point per cell. 

In a number of cases, it is expedient to select an elementary cell 
so that its volume is greater than that of a primitive cell. Thus, in 
order to take maximum advantage of the symmetry of a crystal, we 
often select an elementary cell with an additional lattice point at face 
centres or at the centre of the cell. Three cases are encountered fre- 
quently: 

1) Body-centred cell. An additional lattice point is located at the 
intersection of the spatial diagonals of the cell. In this case, there 


are two lattice points per cell: [[000]] and [sol |. The lattice 
point at the centre of a cell belongs entirely to the given cell. The 


eight lattice points at the corners are shared jointly by eight cells, 
ie., - of each of these lattice points belongs to the given cell. 


2) Face-centred cell. An additional lattice point is located at the 
centres of a pair of faces, e.g., ab. In this case, too, there are two lat- 


tice points per cell: [[000]] and [lel |- 
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3) All-sided face-centred cell. An additional lattice point is lo- 
cated at all face centres. In this case, there are four lattice points 


per cell: [{000]], [[ossl |. [[=os]| and [les ol]. 


The following designations are commonly used: P—primitive 
cell; A, B and C—face-centred cells with a lattice point in faces 
be, ac and ab, respectively; F—all-sided face-centred cell, and /— 
body-centred cell. 

As emphasised earlier, a lattice point is an arbitrary point, but for 
convenience is selected in a specific manner. The succeeding lattice 
point is separated from the selected one by a distance equal to the 
ace spacing. Thus, there is one lattice point per primitive 
cell. 

All the atoms of a primitive cell may be replaced by lattice points. 
Usually, a point of intersection of symmetry elements is taken as a 
lattice point. 

A primitive cell may consist of many atoms. When there are many 
atoms in acell, the structure is described by the coordinates of the 
atoms in an elementary cell. 

The model of a crystal as a space lattice is in full accord with 
experimental data. Crystal edges and faces correspond to nodal 
straight lines and planes. The angles between crystal faces and edges 
are the same for all crystalline objects of a given chemical com- 
pound. 

The symmetrical features of crystal structure may also be deter- 
mined from the space lattice model. 

Crystals of different substances have different symmetry. If a 
crystal is well formed, its symmetry is self-evident. It is clear that 
planes and axes of symmetry 
may be passed through a crystal 
in a specific manner. 

The external symmetry of a 
crystal can be explained by its 
internal structure, i.e, the 
symmetry of the space lattice. 

In addition to axes of sym- 
metry, symmetry elements en- 
countered in crystals include mir- 
ror planes and inversion or sym- 
metry centres. Fig. 254 illustrates 
the operations which may be performed by means of these sym- 
metry elements. 

It has been long known that axes of symmetry of fifth order and 
axes of symmetry of higher order than the sixth do not occur in 
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crystals.* Basing himself on this fact, A. V. Gadolin proved that 
there can be only 32 groups of symmetry among crystals. 

Before the development of the space lattice theory, it was unclear 
why axes of fifth, seventh, etc., orders were not encountered in crys- 
tals. This and other features of crystal symmetry can be explained 
by the space lattice theory. 

Let us consider rotation of a lattice plane. Rotations which are 
opi eels for a plane willcertainly not be possible for the entire 
attice. 

Assume that an axis of n-th order passes through the lattice point 
B and that the identical one closest to it passes through the lattice 
point A (see Fig. 255). 
Rotation about axis 
B transfers lattice 
point A to A’ and ro- 
tation about axis A 
transfers lattice point 
B to B’. It is seen 
from the diagram that 
B’A' =AB (1+2cosa). 

Fig. 255 But the — distance 

A'B’ must be a 

multiple of the lattice spacing AB, for A’B’ is parallel to AB. 
Therefore, 2 cosa must be equal to a whole number. It fol- 


lows that cos a can assume only the values 0, 45 and +1, anda 


the values 60°, 90°, 120°, 180° and 360°. This also follows from the 
definition of a closed syminetric operation: the rotation angles are 
equal to 360° divided by a whole number. Thus, it is possible to have 
rotational axes of symmetry of sixth, fourth, third, second and first 
orders in a crystal. 

One can prove easily that an axis of symmetry is a nodal line 
which is normal to a nodal plane. 

The symmetry of a crystal is determined by the symmetry of the 
space lattice. But it should be realised that the symmetry of a lat- 
tice is considerably richer. Only 32 groups of crystal symmetry exist 
but, as was first shown by E. S. Fyodorov, the founder of structural 
crystallography, a space lattice has 230 types of symmetry (Fyodo- 
Tov groups). 





* If a body is turned about a certain axis by an angle == so that the figure - 


obtained coincides with the original figure, such an axis is called an axis of 
symmetry ‘of n-th order. For example, a 3-faced prism based on an isosceles 
triangle has an axis of symmetry of third order which passes through the centre 
of the triangle and is parallel to the edges of the prism. 
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The richer symmetry of a lattice is due to the fact that in addition 
to closed symmetric operations it includes a symmetry element which 
is not possible for a body, i.e., translation. A symmetric operation 
consists in the displacement of a body to a position which is indis- 
tinguishable from its original position. Therefore, in the case of an 





Fig. 256 


infinite lattice, a displacement along one or another nodal line is 
a symmetric operation. 

Translation introduces the following new symmetry elements: 
1) a combination of rotation and translation—screw axes; and 2) a 
combination of reflection and translation—slip planes. A screw axis 
of fourth order is shown in Fig. 256 (right diagram). Each of points 
2, 3, 4 and 5 is obtained from the preceding one by a 90° turn and 


displacement along the axis by + of a period (f). In Fig. 256 (left 
diagram) we see triangles reflected in a plane Q and slipped along 
AA’ by + of a period. 


230. The Packing of Particles in a Crystal 


Figures having a definite volume and shape can be stacked or 
packed. It is by no means clear to what extent a formation of atoms 
will conform to this picture. Here, the answer to the following ques- 
tion is of prime importance: If we attribute a definite shape to an 
atom or group of atoms, will this even roughly correspond to a mini- 
mum on the potential curve of particle interaction in a given direction? 
Moreover, to what extent will the volume attributed to the atom or 
group of atoms encompass all the electrons, including the valence 
electrons, belonging to this atom or group of atoms? If it turns out 
that describing the limits of an atom or molecule by a definite con- 
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tour has physical meaning, we will have ascertained at the same time 
how this shape is manifested when a crystal is formed from particles. 

The nature of interactions between atoms in a crystal is infinitely 
varied. However, several limiting cases can be singled out: pure 
ionic bonds, homopolar bonds, metallic bonds and molecular bonds. 
As examples, let us consider the structures of rock salt, zinc sulphide, 
iron and the organic substance diketopiperazine (see Fig. 257). 


= 


Period 0 
e,/ O 





Diketopiperazine 
Fig. 257 


In the first three cases, the absence of a tight group of atoms is 
characteristic. A molecule cannot be distinguished in a crystal of 
rock salt. Each atom of sodium has six perfectly equal chlorine atoms 
as neighbours. Nor can a molecule be distinguished in the other two 
compounds, which are examples of homopolar and metallic bonds. 

In ionic crystals, the particles consist of positive and negative 
ions, which attract one another in accordance with the laws of elec- 
trostatics. Everything said about ionic bonds in molecules applies 
to crystals. The nature of ionic structures is communicated reason- 
ably well if the ions are represented by spheres having definite 
ionic radii. The values of ionic radii in ionic molecules differ little 
from those in crystals. 
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In crystals with a homopolar bond, every two atoms are linked 
by a pair of electrons. In this manner each zinc atom is linked with 
its surrounding sulphur atoms, and each sulphur atom with its sur- 
rounding zinc atoms. If a sign of a homopolar bond were considered 
to be an indication of a molecule, it could be said that an entire crys- 
tal constituted a single molecule. It is physically meaningless to 
consider homopolar crystals as constructed of contiguous spheres. 
The dimensions of free atoms of sulphur and zinc are considerably 
greater than the distance between them in zinc sulphide. A homopo- 
lar bond brings the atoms into close contact and makes regions in 
which electrons of these atoms are located common regions. If 
the structure of zinc sulphide consisted of contiguous spheres, 
a large portion of the electron cloud would be located outside the 
spheres, i.e., only 25 per cent of the volume would be occupied by 
spheres. : 

Metallic bonds will be discussed in Chapter XX XVII. Here, a few 
words on this subject will suffice. In metals and alloys, outer elec- 
trons arecommon, forming an electron “gas”. The lattice of a metal- 
lic compound consists of atomic residues (positive ions) “cemented” 
by electrons. Here, too, it is physically meaningless to represent the 
structure as contiguous spheres, in spite of the fact that formally the 
structures of certain metals can be represented as closely packed 
spheres. 

In crystals of the same type as diketopiperazine the molecules are 
distinct. They can be easily recognised since the intermolecular dis- 
tances are considerably greater than the intramolecular distances. 
By studying the arrangement of molecules in crystals, crystallo- 
graphers have been able to pick out intermolecular radii of portions 
of spherical surfaces describing the limits of a molecule. The model 
of a molecule formed of microscopic spheres of intermolecular radius 
is based on an analysis of crystal structures. Such a geometric repre- 
sentation of crystals formed of molecules is quite justified since most 
of a molecule’s electron cloud is contained within the contour of the 
molecule. 

It must be concluded that the representation of the component 
particles of a crystal as geometric figures is meaningful in two cases: 
in ionic and in molecular crystals. On the other hand, such a repre- 
sentation is meaningless in the case of homopolar crystals and metal- 
lic compounds. 

Now, the following question arises: How is the shape of ions and 
molecules manifested in the formation of a crystal? The answer is 
that it is manifested in the compact packing of particles. Experi- 
ments indicate that molecules are always packed in such a way that 
a “projection” of onemolecule fits into a “depression” of another. There 
is a clear tendency for the molecules to become so oriented, with 
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respect to one another that the volume of an elementary cell is 
as small as possible. The situation is similar in the case of ionic crys- 
tals. The stacking of spheres occurs in such a manner that the large 
spheres fit closely together, while the small spheres (ions) fit into 
the empty spaces of the basic structure. 

In representing ions by spheres, and molecules by spatial figures, 
ve find that the “empty” space is equal to 25-35 per cent of the to- 
al. 

Close packing in molecular and ionic crystals provides basic proof 
a shape and volume are meaningful attributes of atoms and mole- 
cules. 


231. Molecular Crystals 


The assertion that molecules are bound by intermolecular forces 
in a molecular crystal is, of course, pure tautology. What then can 
be said about this concept and about the nature of intermolecular 
forces? 

The forces of attraction acting between molecules are interaction 
forces of electrical origin. A molecule possesses electrical properties 
even though it is electrically neutral, i.e., its total charge is equal 
to zero. 

In order to take these properties into account, all we need 
do is consider a molecule to be a dipole. As we know (see p. 249), 
dipoles attract each other. If all the molecules of a substance are 
polar, i.e., have a constant dipole moment, inherent dipole interac- 
tion exists between them. If a portion of the molecules are polar 
(the substance consists of several kinds of molecules), such mole- 
cules may induce dipole moments in neighbouring molecules. This 
is called induced interaction. 

Finally, it may be said that dipole interaction always exists be- 
tween momentary dipoles. (Every molecule is a momentary dipole 
at every instant of time, since the centre of gravity of electrons 
never coincides with that of nuclei.) 

It is not surprising that the potential energy of interaction is 
represented by a very complex function. The calculation of the de- 
pendence of the potential energy of interaction on the mutual orien- 
tation and spacing of molecules in a crystal is hardly possible. The 
potential of the attractive forces can be calculated only when the 
distances between molecules are large. When these distances are 
much greater than the dimensions of the molecule, the forces of in- 
teraction are inversely proportional to r’. This is not the case in 
crystals and liquids. 

The enumerated forces of attraction are counterbalanced by the 
forces of repulsion between the electron clouds of neighbouring 
molecules. The Pauli exclusion principle does not permit the 
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interpenetration of clouds, so the forces of attraction bring the 
molecules into “contiguity”. . 

Although the binding energies of different molecular crystals may 
differ considerably, intermolecular distances in no way depend on 
molecular polarity, binding energy, etc. Evidently, the separation 
between molecules is determined mainly by the well-defined boun- 
daries enclosing the electron cloud of a molecule. 

The shape of a molecule determines the nature of the molecular 
packing as well as the intermolecular distances. Molecular polarity 





Fig. 258 


and other characteristics of the forces of attraction not only do not 
affect the intermolecular distances, but also do not disturb the ten- 
dency to compact packing of molecules. Thus, in practically all 
cases, minimum energy is achieved by compact packing of mole- 
cules. Since molecular crystals strictly obey the principle of compact 
packing, only an exceedingly small variety of structural types and 
symmetry are encountered among such crystals. 

It is convenient to view the packing of molecules in crystals as 
a tight packing of compact layers. Two types of such layers are en- 
countered. These are illustrated in Fig. 258. The one with right- 
angled cells has greater symmetry. In this case (more than 90 per cent 
of organic crystals are formed of such layers), the molecules are 
stacked in the characteristic zig-zag manner shown in the figure. The 
rows of molecules forming a layer are connected by a screw axis of 
the second order (2,). This means that one row of molecules can be 
transformed into an adjacent one by a 180° turn and a displacement 
of half a period along the axis. 

In compact layers, each molecule has six close neighbours. When 
the layers are stacked, a molecule usually obtains six additional 
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close neighbours—3 above and 3 below. Thus, the total number of 
such neighbours becomes equal to 12. 

Crystals having a high order of symmetry are rarely encountered 
in the world of molecular crystals. It is not possible to pack com- 
pactly unsymmetrical molecules in symmetrical crystals. 

If a molecule possesses symmetry, this does not mean that the 
crystal also has such symmetry. A molecule of naphthalene has a 
high order of symmetry; three mutually perpendicular planes of 

J mirror symmetry may _ be 

ce drawn poues it Ss bie 

—Ht- a. 259). If these symmetry ele- 

: Ce \ ments were preserved in a 

formation of molecules, the 

wee: packing would be insufficient- 

ly compact. Therefore, the 

symmetry elements of a mol- 

Fig. 259 ecule which prevent greater 

compactness are “lost” in the 

formation of a crystal. The preservation of a centre of inversion 

is possible without sacrificing compactness of molecular packing. 

A crystal formed of molecules possessing this symmetry element 

usually does not preserve other symmetry elements of the molecules, 
but does preserve a centre of inversion. 

In other cases as well, the outcome of the tendency to symmetry 
as opposed to the tendency to compactness can bereliably predicted. 

An example of a packing arrangement typical of molecular crys- 
tals is provided by diketopiperazine (see Fig. 257). The molecules 
have a high order of symmetry, but the crystal preserves only a cen- 
tre of inversion. A molecule, of course, does not cease to be highly 
symmetrical simply because a crystal of such molecules does not 
possess its symmetry elements. — 


232. Compact Packing of Spheres 


A very important class of ionic crystals may be represented by a 
compact packing of spheres. 

Most anions are larger than cations. In such cases, crystals consti- 
tute a compact packing of anion spheres between which cations are 
located. This is how silicates, one of the largest groups of natural 
inorganic substances, are formed. In silicates, the cations are located 
in the empty spaces of a compact structure of oxygen anions. 

Let us examine the laws of compact packing of spheres, i.e., the 
fundamental structures of a great number of crystals. The only pos- 
sible arrangement of a compact layer of spheres is shown in Fig. 260. 
Each sphere has six neighbours. To form a compact packing arrange- 


Sec. 232] : Compact Packing of Spheres 585 


ment, one must place the spheres of a second layer in the spaces of 
the layer below it. It is not possible to fill all the spaces with spheres 
of the same size: every second space in the figure is filled (crosses de- 
note the spaces of the first layer which are filled by spheres of the sec- 
ond, and dots denote the spaces which remain vacant). 

There is also only one possible arrangement for compact packing 
of two layers of spheres. For three layers, however, the situation is 
different. In order to achieve com- 
pact packing in this case, one must 
place the spheres of the third layer 
in the spaces of the second, but 
this can be done in two ways: the 
centres of the spheres of the third 
layer may be placed either above 
the centres of the spheres of the 
first layer or above the spaces 
denoted by dots. Both three-layer 
structures are packed equally 
compactly; however, they differ 
significantly from one another. 
When a fourth layer is added to the 
structure, the number of possi- 
ble packing arrangements becomes 
even greater: from the two three-layer structures, one can make four 
four-layer structures. In the case of a five-layer structure, there are 
five possible arrangements, etc. It is evident that the number of dif- 
ferent arrangements of spheres packed equally compactly increases 
very rapidly with increasing number of layers. 

Now, let us compare a crystal lattice with such an arrangement 
of spheres. A crystal may be represented as a structure of spherical 
atoms in which the arrangement of layers is repeated exactly after 
a certain number of layers. If such a sequence begins with the four- 
teenth layer, for example, this means that a cell is thirteen layers 
high. In such a case, the fourteenth layer is above the first, the 
fifteenth above the second, the sixteenth above the third, etc. 

The simplest packing arrangement consists of two layers: the 
third layer lies above the first, the fourth layer above the second, 
etc. (see Fig. 261, right). This is a hexagonal compact packing arrange- 
ment. A cell of such a crystal is shown in the lower right-hand cor- 
ner of the figure. The dots and crosses denote the locations of the 
centres of the spheres. 

Three-layer crystals, in which the fourth layer is a repetition of 
the first, the fifth of the second, etc., are very common (see Fig. 261, 
left). In the lower left-hand corner of the figure, where only the 
centres of atoms are indicated, we see that a cubic elementary cell, 
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centred in all faces, may be selected. Here, the compact layers are 
arranged perpendicular to the spatial diagonal of a cube. Such a 

. structure is called a 
cubic compact-pack- 
ing arrangement. 

Empty spaces re- 
main in a packing of 
spheres of equal size. 
It can be easily calcu- 
lated that the volume 
of such spaces is equal 
to about '/, of the 
overall volume. There 
are two kinds of 
such empty spaces: 
one is surrourided by 
four spheres the cen- 
tres of which are locat- 
ed at the vertexes of 
a regular tetrahedron 
(see Fig. 262a); the 
other is surrounded 
by six spheres—the 

Fig. 261 centres of these spheres 

form a_ regular oc- 

tahedron (see Fig. 2626). The first kind is smaller in size, but there 
are twice as many of them as the second. 

It can be shown that in any compact arrangement of equal spheres 
there are two small empty spaces and one large one per sphere. Small 
spheres can fit into these spaces; but if they are somewhat too large, 
they cause the large neigh- 
bouring spheres to move apart, 
loosening the compact packing 
arrangement. 

Since different packing arrange- 
ments are possible with equal 
numbers of spheres, and small 
spheres may fill the empty spaces 
in different ways, ionic crys- 
tals have a great variety of Fig. 262 
structures. 

In crystals of common salt, a compact three-layer structure is formed 
by large chlorine ions (light spheres in Fig. 257), and sodium ions 
(dark spheres) fill all the large spaces; hence, every sodium atom is 
surrounded by six chlorine ions. In iron disulphide (pyrite), a com- 
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pact two-layer structure is formed by large sulphur-ions; iron ions 
fill all the large spaces. In a crystal of lithium oxide, the chemical 
formula of which shows that there are two lithium atoms for every 
oxygen atom, the compact structure is formed by large oxygen 
ions. Since lithium ions fill all the small spaces, each lithium ion 
has four neighbours (oxygen ions). In a crystal of cadmium chloride, 
the chemical formula of which shows that there are two chlorine 
atoms for every cadmium atom, the compact structure is formed 
by large chlorine ions; cadmium ions fill large spaces but not all, 
i.e., they fill the large spaces of every third layer of chlorine ions. 
We have presented, of course, only the simplest “patterns” of the fill- 
ing of empty spaces in compact packing arrangements. 


233. Examples of Crystal Structures 


The largest group of crystals consists of bodies formed of mole- 
cules. Ionic compounds also constitute a fairly large group. As indi- 
cated already, the representation of a crystal in these cases as closely 
packed particles is entirely justified. However, it is necessary to exam- 
ine those structures in which the direction of the bonds between 
atoms, the deviation of the electron cloud from spherical symmetry, 
etc., are the cause of structural arrangements which cannot be rep- 
resented so simply. Such exceptions include structures of atoms 
bound by common electrons. 

Most metals have structures consisting of body-centred cubic 
cells. In such crystals, each atom has eight neighbours rather 
than twelve, as is the case in a compact packing of spheres. This is 
the case, for example, for atoms of iron (see Fig. 257). The lattice 
of iron is cubic; iron atoms are located at the corners and centres of 
cubes. Lithium, potassium, caesium and a number of other sub- 
stances possess such a structure. 

In Fig. 263, the structure of crystalline mercury is compared with 
an ideally cubic compact packing arrangement. It can be seen that 
the nature of the arrangement of atom centres is the same in both 
cases, but in the former the distance between layers is less, and the 
distance between atoms in a layer is more than in the latter. 
This is analogous to a compact packing of slightly flattened spheres. 

Many examples exist of such more or less “damaged” compact 
packing structures. In the case of ice (see Fig. 264), all resemblance 
to a spherical packing arrangement is lost. The link between each 
pair of oxygen atoms is implemented by a hydrogen atom. In these 
four bonds, there are two oxygen atoms per hydrogen atom. (The 
structure shown in Fig. 264 does not, of course, contradict the chem- 
ical formula for water.) For purposes of clarity, a “hydrogen” 
bond is shown in the figure as a “neck”. The structure of ice is very 
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loose, as indicated by the large “holes”. If one projects the structure 
above the plane of the figure, these holes are transformed into broad 
channels which ‘pass through the structure. The structure of ice is 
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Fig. 263 





an important exception to the general rule. This does not mean that 
the cases in which the likening of a crystal to a compact packing of 





Fig. 264 


particles is meaningless are rare. 
As indicated above, crystals 
formed of atoms bound by com- 
mon electrons cannot be likened 
to a compact packing of spheres. 
The structure of zinc sulphide, 


) which is illustrated in Fig. 


257, is quite typical. Moreover, 
several elements reveal a simi- 
lar structure. These include car- 
bon (diamond), silicon, germa- 
nium and tin (white). 

Homopolar bonds can form 
layers and chains of atoms. 

Fig. 265 shows the structure 
of graphite. The carbon atoms in 
graphite form a layered struc- 


ture, but these layers are not the same as those of a compact packing 
arrangement. It is not possible to form a layer of graphite of 
contiguous spheres. In graphites, layers of strongly bound atoms 
constitute planes. Arsenic and phosphorus also form layered struc- 
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tures in this sense, but the atoms of their layers are not arranged 
in a single plane. An example of a structure consisting of chains 
of strongly bound atoms is provided by gray selenium. Each atom 
of this substance is strongly bound to only two neighbours. In 
gray selenium, the atoms form an endless spiral about a straight 
line. The separation between atoms of neighbouring spirals is 
considerably greater than thesepara- 
tion between close atoms in a giv- 
en spiral. 

The black, lustreless, soft graph- 
ite used in pencils and the shiny, 
transparent, hard diamond which 
can cut glass are composed of the 
same kind of atoms, i.e., carbon. 
This is a very striking example 
of how greatly the properties of 
a crystal are affected by the ar- 


rangement of its atoms. Refractory  AUlaEe jie 
crucibles capable of withstanding ALL BELLI A 


Mall OL 
temperatures of 2,000-3,000° are 


made of graphite, while at temper- Fig. 265 

atures exceeding 700° diamonds 

are burned up; graphite has a specific weight of 2.1 as compared 
with 3.5 for diamond; graphite conducts electric current, but dia- 
mond does not; etc. 

The ability to form different crystals is not a property of carbon 
alone. Almost every chemical element in the crystal state and every 
substance has several forms. Six forms of ice are known, nine of sul- 
phur, and four of iron. 

At room temperature, atoms of iron form cubic lattices, the atoms 
being located at the corners and centres of cubes. Thus, each atom 
has eight neighbours. At high temperatures, iron atoms form a com- 
pact structure. Here, each atom has twelve neighbours. Iron possess- 
ing eight neighbours is soft, while iron possessing twelve is hard. 
The quenching of steel fixes, at room temperature, a compact cubic 
structure which is stable at higher temperatures. 

We have seen in the cases of carbon and iron that the structures 
of different crystals of one and the same substance may differ consid- 
erably from one another. The same holds true for other substances. 

Thus, for example, in a crystal, yellow sulphur forms corrugated 
rings of eight atoms each. Each ring constitutes a sulphur molecule. 
Red sulphur also consists of such rings but they are turned completely 
differently with respect to one another. 

Yellow phosphorus atomsformcubic structures withe ight close neigh- 
bours. Black phosphorus has a Jayered structure similar to graphite. 










> 
BT VY 


; Wp ty Yl LE 


1e™:7 UY, AG, WH YYfe 
LILLE 





600 Atomic Structure of Bodies (Chap. XXXII 





Gray tin has a structure similar to that of a diamond. Theoreti- 
cally, white tin could be obtained from gray tin by strongly compress- 
ing the diamond-like structure along the axis of a cube. As a result 
of this flattening process, a tin atom would havesix close neighbours 
instead of four. 

Organic substances also frequently have a variety of crystal forms. 
The very same molecules are arranged differently with respect to 
one another. : 


234. Thermal Vibrations in a Crystal 


From the viewpoint of energy, an ideal crystal is, in a way, the 
antithesis of an ideal gas. 

In an ideal gas, the interaction energy of particles is much less 
than the average energy of thermal motion kT. On the other hand, 
since strong coupling exists between particles in a crystal, the inter- 
action energy is much greater than kT. Therefore, thermal motion 
in crystals cannot disrupt the coupling between atoms, but merely 
results in small vibrations of the atoms about equilibrium positions. 

In a crystal, every atom vibrates about an equilibrium position. 
For most crystals, the vibration amplitudes are of the order of 
0.14, i.e., a small fraction of the distance between close atoms, 
which, as we know, is of the order of 1.5-2A. 

The nature of this vibration may be very complex. During a vi- 
bration period, an atom describes a complex curve about its equilib- 
rium position. This is due to the fact that an atom is bound to its 
neighbours by different forces; hence, its vibrations are of an ani- 
sotropic nature. In any case, it is always possible to resolve the vi- 
brations of an atom along three axes. Evidently, the atoms of a crys- 
tal will have 3N degrees of freedom, where N is the number of atoms. 

If the molecules of acrystal are clearly distinguishable, it is mean- 
ingful to speak of vibrations of a molecule and vibrations of atoms 
within a molecule. Since the coupling between molecules is consid- 
erably weaker than between atoms, the frequencies of their vibra- 
tions will be less. In molecular crystals, the motion of a molecule as 
a whole is of decisive importance. The vibrations of a molecule about 
its equilibrium position are of a translational as well as torsional 
nature. Apparently, it is even possible in rare cases for total rotation 
of molecules about centre of a gravity to occur. For example, such 
rotation of molecules probably occurs in the case of solid 
methane (CH,). , 

The total energy of a vibrating particle consists of its potential 
energy and its kinetic energy. The average values of these two ener- 
gies over a period of vibration are equal to each other. As is known, 


the average kinetic energy of an atom in a gas is equal to $ kT. 
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It would seem reasonable to assume that, at the same temperature, 
a vibrating atom possessing twice the average energy possesses 3 kT 
units of thermal energy. Then, one mole of crystalline substance 
should have an energy 3RT and a molar heat capacity C,=3 R= 
=6 cal/mole. 





Fig. 266 


At high temperatures, this formula is in very close agreement with 
experimental results. The temperature dependence of the heat ca- 
pacity of crystalline bodies is shown in Fig. 266. Beginning at zero, 
the heat capacity increases, attains a value of 6 cal/mole at a certain 
temperature, and then remains unchanged. The ratio of tempera- 
ture to a constant 4 (to be discussed in the following article) is plot- 
ted along the x-axis. 

At high temperatures, the value of &T is significantly greater than 
the difference between vibrational energy levels; hence, the quantum 
nature of distribution of vibrating atoms according to energy levels 
does not affect the value of the average vibrational energy. Under 
such conditions, the simplified method of calculating the average 
energy is quite justified. This is confirmed by exact calculations 
which take into account the distribution of atoms according to ener- 
gy as given by the Boltzmann law. 

When kT becomes comparable to the difference between energy 
levels, the Boltzmann law is no longer applicable and must be re- 
placed by a quantum distribution law (see p. 678). Calculations indi- 
cate that heat capacity decreases with decreasing temperature. We 
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shall not present these calculations; we note, however, that quali- 
tatively a decrease in ¢, with decreasing temperature is quite under- 
standable. The smaller the magnitude of k7, the smaller the number 
of energy transitions that may occur in a system. This means that 
the possibility of thermal exchange decreases with decreasing kT 
and approaches zero as a limit. The limiting case can be explained as 
follows: Energy cannot be transmitted to a body in extremely small 
bursts kT even if there are an infinite number of such “bursts” with a 
very high total energy. Energy cannot be transmitted since a single 
“burst” does not suffice to transfer a system from its zero-energy lev- 
el to the next one. 

Experiments indicate that energy transitions corresponding to 
a change in the state of molecular motion in a crystal lie in the region 
of long infrared waves. 

For purposes of guidance, let us assume that such transitions cor- 
respond to a wavelength of 1 mm. 

Let us compare several values of RT with the quantum energy cor- 
responding to a wavelength of 1 mm. For this wavelength, v= 
=3%x10" sec—*, i.e., Av=20010-"? erg, 


RT (ergs) 
Temperature (°K) (Approx. values) 
500 7,000x 107” 
100 1,380x 107" 
10 138x 10737 
1 114x107” 


It can be seen that at 100°K the thermal vibration energy still 
considerably exceeds the difference between the energy levels of 
a molecule in a crystal. At 10°K it has the same order of magnitude 
and at 1°K thermal vibrations are unable to produce transitions from 
one level to another. 


235. Thermal Waves 


An interesting feature of thermal vibrations in a crystal is their 
occurrence in the form of thermal waves. Therefore, atomic vibra- 
tions cannot occur independently of one another. An atom deviating 
from its equilibrium position pulls along the next one. 

Since a crystal is a finite body, standing waves are formed within 
it. As in the case of all natural oscillations, the maximum length of 
a standing wave equals twice the dimension of the body. The boun- 
daries of a crystal must correspond to standing-wave nodes. 

On p. 134 we discussed elastic vibrations of solids considered as a 
continuum. It was shown that there arise in a finite solid body nu- 
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merous standing waves of different direction and frequency. The 
picture becomes considerably more complex when the atomic struc- 
ture of the solid is taken into account. A theoretical investigation of 
the possible vibrational motion of atoms in a monocrystal indicates 
that thermal motion in a crystal can be represented as the result 
of the superposition of 3sV waves, where N is the number of cells 
and s the number of atoms per cell. The number of possible waves 
is equal to the number of degrees of freedom of the system of atoms 
forming the crystal. How do these 
waves arise and how are they man- 
ifested> 

Let us restrict ourselves to a 
consideration of a chain of atoms, 
i.e.,a “unidimensional crystal”. Fig. 
267 shows such an atomic chain, 
the “cells” of which consist of two 
atoms, denoted by black and white 
dots. Since the actual thermal mo- 
tion of atoms in a crystal is of a 
very complex nature, the figure has 
been simplified to show the “ele- 
mentary” waves into which this Fig. 267 
motion can be resolved. Calculations 
indicate that the resulting vibration can always be ftepresented as 
the sum of harmonic vibrations. Like in the case of a solid rod, a 
series of waves of various wavelengths arise in a unidimensional crys- 


‘ tal. If a chain consists of a thousand cells with period a, there arise 
N waves of wavelength 2,000 a, 1,000 a, “a, 500 a, 400 a, ete. 
The shortest wavelength is 2a. 

But this is not all. Each of the possible wavelengths occurs in s 
variations. Two types of waves of the same wavelength are shown in 
the figure. A case exists in which the lattice of atoms vibratesas a 
whole. Such a wave is called an acoustical wave. The remaining 
s—l waves are quite different. In these cases, different types of atoms 
execute complex motion with respect to one another and at each 
instant only atoms of a single type fall on the sinusoid. There are 
s—1 such vibrations, which are called optical vibrations. 

The figure shows waves corresponding to atomic vibrations ina 
single direction. 

Atomic vibrations can always be resolved into two transverse 
components and one longitudinal component. Therefore, a wave of 
given wavelength travelling in a given direction will have 3 compo- 
nents of the acoustical type and 3 (s—1) of the optical type. Of the 
total of 3sN waves, 3V—two transverse and one longitudinal for 
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each wavelength in each direction—will be acoustical. This is com- 
pletely valid for three-dimensional crystals. 

Although the wavelengths and frequencies of the waves have dis- 
crete values, we may utilise the results obtained on pp. 135-36 and 
express, approximately, the number of acoustical vibrations having 
frequencies less than v as 


Here, vu is the volume of the crystal and c the velocity of the wave. 
The velocities of the longitudinal and transverse waves are differ- 
ent. Therefore, the total number of waves, which is equal to 3N, 
should be written as follows: 
ine Caen Wigs ON, 
C. 


3 
Cy t 


where c, is the longitudinal velocity and c, the transverse velocity 
of the wave. Whence, the value of the maximum frequency of vi- 








bration, v,,,,,can be easily found. The corresponding wavelength, 
Mie = eet 
Ee ie Mina 


is, as it should be, of the same order of magnitude as a cell period. 

If the velocity of propagation of acoustical waves in a crystal is 
known, we can calculate v,,,,, the value of which determines to a large 
extent the behaviour of a crystal. 

As was indicated above, heat capacity depends on the commensur- 
ability of the energy Av and the thermal energy kT. If hv,,,.<kT, 
thermal exchange excites all of the vibrations and waves in a crys- 
tal, i.e., all of the quantum transitions are possible. As a result, 
the quantum nature of thermal exchange is not apparent. Such a 


crystal has a characteristic temperature § = 22x which is much 


less than the temperature of the experiment, i.e., <T. On the oth- 
er hand, if fv,,q,5>kT, i.e., if the characteristic temperature §S>7, 
only vibrations of low frequency are excited in the crystal because 
high energy levels cannot be surmounted by the thermal “bursts”. 

Here are examples of the characteristic temperatures (°K) of a 
number of crystals: 


Pb Benzene Ag NaCl Fe Be Diamond 
90 150 215 280 450 1,000 1,860 
For such substances as lead and benzene, room temperature is “high”. 


This corresponds to the horizontal portion of the heat capacity curve 
(c,=6 cal; see Fig. 266). On the other hand, room temperature is 
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low for beryllium and diamonds. Thermal vibrations of these sub- 
stances are excited to an insignificant extent and their heat capacity 
is considerably less than 6 cal/mole. The maximum vibration fre- 


quencies Vmax = can be calculated from the above values of the 
characteristic temperature 9: 


Pb Benzene Ag NaCl Fe Be Diamond 
1.88x10"% 3.13x10 4.47x10% 5.84x10"% 93x10" 20.8x10" 38.810" 


It can be seen that the limiting frequencies of thermal vibrations lie, 
as was assumed in the example on p. 602, at the boundary between 
the infrared and radio bands (A,,;,>10~? cm). 


236. Thermal Expansion 


How can we explain the fact that the average distance between neigh- 
bouring atoms increases with increasing temperature? To answer 
this question, let us consider the curve of potential energy of inter- 
action between atoms or mole- y 
cules (see Fig. 268). Irrespective 
of the peculiarities of the inter- 
action between particles, the 
potential curve is always asym- 
metrical: in the direction of de- 
creasing separation between par- 
ticles the curve rises steeply, 
while in the direction of increas- 
ing separation a well wall is 
formed. This is due to the follow- 
ing simple fact: practically, the 
separation between two atoms or 
molecules cannot be decreased 
indefinitely, but it can be in- 
creased indefinitely—at great dis- 
tances the bond between the 
particles is broken. 

The maximum and minimum 
distance between vibrating atoms 
may be noted on a potential Fig. 268 
curve. The middle of the segment 
connecting the two limits corresponds to the average position of an 
atom. When the temperature increases from 7, to T, the energy of 
a vibrating particle increases and the particle passes over to another 
energy level (see Fig. 268). Since the potential curve is asymmetrical, 
the average position of an atom is displaced to the right. Therefore, 
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the average separation between atoms will be greater than the equi- 
librium separation between atoms at rest, i.e., the r corresponding 
to the minimum of the potential well. Thermal expansion results 
from the fact that the average separation between atoms increases 
with temperature. 

The thermal expansion of a crystal is anisotropic, i.e., in differ- 
ent directions the coefficient of linear expansion a has different 
values. Therefore, when in- 
dicating the value of a, 
the crystallographic direc- 
tion of interest should be 
specified. 

When it is required to 
give a complete description 
of the thermal expansion 
of a crystal, an expansion 
diagram may be used. Such 
a diagram is shown for a 

Fig. 269 naphthalene crystal in Fig. 

269; a, b, c are the crystal 

axes and A,, Ay, Ay, the axes of symmetry of the expansion dia- 

gram. The length of a radius vector drawn from the origin to a point 

on the surface of the diagram gives the value of @ in the given di- 
rection. 

The shape of an expansion diagram and its orientation relative 
to the cell axes accord with the symmetry of thecrystal. It cannot be 
otherwise since physical properties must be the same in directions 
connected by symmetry operations. 





In order to determine linear expansion coefficients, we must have at our dis- 
posal means to measure very small displacements with a high degree of accura- 
cy. Instruments for measuring thermal expansion are called dilatometers. In- 
terference methods (see p. 355) can provide the required sensitivity (several 
hundredths of a micron or better), but these require perfectly ground specimens, 
which are very difficult to prepare. 

In practice, quartz differential dilatometers are used. In such instruments, 
the specimen to be measured is placed in a cylindrical holder made of quartz 
glass. At the bottom of the holder a base prism is placed. One end of the speci- 
men rests on this prism. A quartz rod, which transmits the expansion of the spec- 
imen to the measuring portion of the instrument, rests on the upper end of the 
specimen. The displacement of the end of the rod is measured by means of a 
microscope or rotary mirror. Since the holder as well as the specimen expands 
upon heating, the instrument registers the difference between the coefficients 
of expansion of the specimen and the holder. The necessary corrections, based 
on available data on the thermal expansion of quartz over a wide range of tem- 
peratures, may then be introduced. 

The most accurate method of obtaining an expansion diagram is by means 
of X-ray structural analysis—measurement of the displacements of diffraction 


spots. 
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Coefficients of Linear Expansion 





| t°C ax10', deg™? 
Aluminium ....... 0-100 0.238 
Gypsum ......48. 12-25 0.025 
Quartz, || to the axis. . 40 0.0781 
Quartz, | to the axis . 40 0.1419 
TCO rs) aimed 8 POR cel HE —10-0 0.507 


237. Crystal Imperfections 


Block Structure. The structure of an actual crystal differs consid- 
erably from that of an ideal space lattice. This assertion is based 
on numerous facts, including direct electron-microscopic observation;. 
However, our basic knowledge of inner crystal imperfections is de- 
rived, in the first place, from strength measurements. A crystal will 
rupture when it is subjected to a stress of less than a hundredth 
of the stress which an ideal object should withstand. Deformations 
and the rupture of crystals will be discussed in Chapter XXXIV. 
Here, we shall summarise our present knowledge of crystal imper- 
fections. 

Briefly, the situation can be described as follows: A monocrystal 
does not constitute a single lattice. It consists of a large number of 
tiny blocks which are slightly 
misaligned (within the limits 
of several seconds or minutes) 
with respect to one another. 
The dimensions of the blocks 
may vary within rather broad 
limits. In most cases, they lie |! 
in the range of 10~°-10-* cm. 
Plotting of the block dimen- 
sions of acrystal would prob- 
ably yield a characteristic 
distribution curve. 

Of great interest is the ar- 
rangement of particles at the 
boundary between two blocks. Fig. 270 
There is good reason to as- 
sume that a liquid surface covered with soap bubbles can serve as 
an excellent model. 

An examination of Fig. 270 shows that a “fracture” exists in the atom- 
ic rows close to the centre of the model. The portion of the “structure” 





608 Atomic Structure of Bodies (Chap. XXXII 


illustrated in the figure can be represented as four blocks with 
a common corner at the centre of the model. An imperfection is clear- 
ly visible at the centre. Here, the upper “atoms” have not fallen into 
their right places, i.e., they do not fit into the empty spaces of the 
compact packing structure. This fault has led to the splitting up of 
the crystal into blocks. ia 

Many such faults—called dislocations—exist in a crystal. They 
are distributed randomly and may turn an atomic row to the lelt 
or to the right. Therefore, on the average, all crystallographic align- 
ments extend through an entire monocrystal with great exactitude. 
Dislocations give a monocrystal a block (or mosaic) structure. To be 
sure, the presence of chance microfissures, or empty spaces, several 
atoms deep, also facilitate the formation of block structures. 

An examination of the figure shows that a dislocation may also 
be pictured as follows: two adjacent rows, one of which has one par- 
ticle more than the other—an extra atom has “found its way” into 
one of the rows. 

Dislocations. Fig. 270 represents a two-dimensional model of a 
crystal. It is as if each row were the projection of an atomic layer 
which is oriented perpendicular to the 
figure. The large fault would corre- 
spond in a three-dimensional crystal 
to a linear region perpendicular to the 
figure. This region may be called the 
core of the dislocation. 

Dislocation patterns not only ex- 
plain the block structure of crystals, 
but also many other phenomena. There- 
fore, it is well to study these pecu- 
liar distortions of crystals in detail. 

There are two kinds of disloca- 
tions—simple and spiral. The disloca- 
tion illustrated by the bubble model 
is of the simple kind. Schematically, 
such a dislocation is illustrated in 
Fig. 271la. The core of the dislocation 
is designated by an inverted T. The 
distortion is maximal near the dislo- 
cation plane dividing the crystal into 

Fig. 271 two parts and rapidly diminishes 

in either direction away from the 

dislocation line. Fig. 2715 shows a top view of the two 
adjacent atomic planes on either side of the boundary between the 
blocks. The upper, or compressed, plane (designated by solid lines) 
contains one row more than the lower one (designated by dotted lines). 
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Analogous diagrams for a so-called spiral dislocation is shown in 
Fig. 272. The lattice is divided into two blocks, one of which has 
slipped, so to speak, a distance of one period relative to the other 
(see Fig. 272a). At the axis shown in the figure, the distortion is max- 
imal. The region adjacent to this axis is called a region of spiral 
dislocation. It will be easier to grasp the essence of this distortion 
if we examine the diagram of the adjacent atomic planes on either 
side of the boundary between blocks (see Fig. 272b). This is the right 
view of the three-dimensional figure. The spiral dislocation axis is 
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Fig. 272 


the same as in the three-dimensional figure. Solid lines denote the 
plane of the right block and dotted lines those of the left. As may 
be seen from the diagram, a spiral dislocation differs from a simple 
dislocation. There is no extra row of atoms in this case. The distor- 
tion consists in the fact that the atomic rows change their closest 
neighbours near the dislocation axis, i.e., they bend and drop down 
one level. 

Why is this called a spiral dislocation? This can be explained as 
follows. Let us move around the dislocation axis—along the nodal 
planes of the lattice—beginning at the lowest plane. After each rev- 
olution, we are one level higher. In this manner, we reach tke top 
of the crystal, much the same as having climbed a spiral staircase. 
In Fig. 272, the spiral motion would be counterclockwise. If the 
blocks were displaced in the opposite direction, the spiral motion 
would be clockwise. 

In a given object, one may encounter successive spiral dislocations 
having the same rotation direction. If two dislocations having 
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different rotation directions are in the same plane, the resulting 


distortion is more complex. 


Imperfections within a Block. A crystal lattice may consist of 
blocks which also have imperfections. These imperfections may be 
in the form of lattice vacancies or foreign atoms. A very small number 


Fig. 273 





of lattice vacancies and foreign atoms 
can result in considerable distortion. 

Fig. 273 shows the nature of these 
distortions. In (a) we see the effect of 
a foreign atom that replaces one of 
the basic atoms of a lattice, in (6) the 
effect of a foreign atom penetrating 
between basic atoms, and in (c) the 
effect of a lattice “vacancy”. The dis- 
turbance may extend to a depth of 
5-10 lattice spacings in each direc- 
tion. This encompasses a region of the 
order of 1,000 cells. Therefore, an 
impurity of the order of 0.1 per cent 
may fundamentally change the prop- 
erties of a crystalline substance. (It 
should be noted, however, that an 
impurity does not produce appreciable 
lattice distorion.) In Sec. 272 we shall 
deal with the case of semiconductors 
in which impurities of the order of 
one part in a thousand million may 
change the electrical properties of a 
body. 


238. Short-Range Order. Liquids 


It was seen at the beginning of this 
chapter that very many solids can be 
represented as compact packings of 
spheres. In such structures, the frac- 
tion of the total volume consisting 
of empty space is equal to 26 per cent. 

Copper is an example of such a crys- 
tal. How does the structure of a piece 
of copper change when it is melted? 
Experiments show that the volume 
increases by about 3 per cent. This 
increase is due to an increase in emp- 
ty space, which now equals 29 per 
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cent of the total volume instead of 26 per cent. The compact struc- 
ture has loosened and the spheres are able to move away from their 
“proper” positions. The ideal order which is characteristic of a crys- 
tal has been disturbed. : 

As a result of thermal motion, the spheres vibrate, in general, 
about their equilibrium positions and remain surrounded by the 
same neighbours. Now and then, neighbours may change when a space 
of the same size as the volume of a sphere happens to be created 
in the vicinity of the sphere. Owing to the closeness of particles in 
a liquid, a so-called short-range order arises. In a model of spheres, 
one sphere cannot approach another closer than the diameter of a 
sphere. Such a deviation from ideal randomness occurs in gases as 
well, but it is of little importance there since theclosest molecules in 
a gas are separated, on the average, by a distance ten times as great 
as the dimensions of a molecule. 

Let us examine a molecule in a liquid and imagine two concen- 
tric spheres about it. Assume the radius of one is equal to that of the 
molecule and the radius of the other to three times this value. On the 
average, how many close neighbours does such a molecule have? By 
close neighbours we mean molecules located in the region between 
concentric spheres. Consider the example of copper, the volume of 
which increases by 3 per cent when it is smelted. According to cal- 
culations there are, on the average, 11.6 atoms in the region under 
consideration. Thus, there are to be found about 12 close neighbours 
the centres of which are separated from the given atom by a dis- 
tance equal to its diameter. Closer neighbours are not to be found. 

It is clear that the short-range order 
affects not only close neighbours but suc- U(r) 
cessive ones as well. Therefore, it is ? 
customary to describe the short-range or- 
der by the average density of radial / 
distribution of atoms. : 

Let us imagine two concentric spheres 9°—3~Y—G_ 3 OO KA 
of radius r and r+dr about an atom. For 
simplicity, assume we are dealing with Fig. 274 
a monatomic liquid. The volume of this ; : 
spherical shell will be 4ur’dr. The number of atoms falling within 
this shell may be expressed as 


U (r) X 4ar7dr, 


where U(r) is the density of radial distribution of atoms. ; 

A U(r) curve for amorphous arsenic is shown in Fig. 274. Maxima 
of the curve indicate that certain interatomic spacings acquire con- 
siderably more “weight” than others. The origin of successive maxima 
is exactly the same as of the first. The density of packing is such as 
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to allow the number of closest neighbours of a given atom to fluc- 
tuate only within very narrow limits, while the number of neigh- 
bours closest to the closest may fluctuate within somewhat broader 
limits. As the distance from the central sphere increases the random 
order becomes more and more evident. The U(r) curve approaches a 
limit, i.e., the short-range order fades away and gradually passes 
over into arandom order. It is convenient to set the value of U(r) 
equal to unity at r+ oo. This distinctive order with regard to close 
neighbours, which fades away as the distance from the atom or 
molecule under consideration increases, is what is meant by a short- 
range order. 

The order in the arrangement of particles characteristic only of 
crystals is called a long-range order. This means that the three-di- 
mensional periodicity peculiar to a crystal does not fade away at 
great distances. The arrangement of atoms along a nodal line regu- 
larly repeats itself thousands and millions of times. 

When we discussed the structureof crystals, we saw that atoms by 
no means always behave like spheres. This applies to liquid struc- 
tures as well. , 

In an ideal case, the short-range order in an atomic liquid should 
result in the number of close neighbours being equal to almost twelve. 
Experiments show that metals the crystal structures of which 
consist of compact arrangements of spheres continue to have such a 
short-range order, i.e., an average number of close neighbours just 
short of twelve, after being melted. ; 

As indicated above, every atom of lithium, sodium and potassium 
in a crystal has eight close neighbours. The same short-range order 
is preserved in a liquid, but the average number of close atoms be- 
comes somewhat greater than eight. 

Simple substances in the crystalline state of which the atoms are 
firmly bound to a small number of neighbours behave differently. 
These bonds are broken when such substances are melted and the 
number of closeneighbours per atom of fluid becomes greater than in 
a crystal of the same substance. 


239. Amorphous Bodies 


The word “amorphous” means “without form”. Amorphous solids 
are the antitheses of regular polyhedral crystals. However, the shape 
of a polycrystalline body is not regular even though it is not amor- 
phous. How then may crystals and crystalline bodies be recognised? 
They may be recognised, primarily, by their well-defined melting 
points. If heat is applied to a crystalline body, the temperature of 
the body increases until it begins to melt. Thereupon, the tem- 
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perature ceases to rise and the entire melting process takes place at 
the melting temperature. 

Ordinary glass is a typical amorphous solid. It grows soft when 
it is heated and gradually goes over into the liquid state as the tem- 
perature is raised. 

This behaviour of amorphous bodies can be explained by their 
structural peculiarities, leading to the classification of such bodies 
as liquids rather than solids. 

As indicated, there exists a long-range order of particles in crystal- 
line bodies. In amorphous bodies, only a short-range order of particles 
occurs and in this respect such bodies do not differ from liquids. 





° Si 
Fig. 275 


Fig. 275a shows the structure of quartz (silicon dioxide), and 
Fig. 275 the structure of quartz glass. From the chemical viewpoint, 
a substance can be obtained in a crystalline form as well as in an 
amorphous form. The similarities and differences between these two 
states can be clearly seen in the figure. Apparently, an amorphous body 
is a “damaged” crystal. In a crystal and an amorphous body, the 
number of close neighbours and the nature of the encirclement are the 
same. Possibly, a pentagonal ring is particularly advantageous ener- 
getically for SiO, groups. Since the symmetry of an axis of fifth 
order cannot produce a periodic structure (see p. 687), amorphous 
glass is obtained. 

The absence of a long-range order, which is characteristic of crys- 
talline bodies, is the immediate cause for the absence of a well-de’ 
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fined melting point. At the melting point, a transition occurs and 
the long-range order disappears. Only a short-range order in the 
arrangement of atoms remains. 

In amorphous bodies, the nature of the arrangement of atoms does 
not change when the temperature is increased. Only the mobility 
of the atoms changes, i.e., the atomic vibrations increase. At first 
only a few atoms are able to escape from their encirclement and 
change neighbours. This number gradually increases until, finally, 
the rate of such changes becomes the same as in water. 

The ease with which a given molecule may change its neighbours 
is related to an important property of a liquid, namely, its viscosity. 
The less frequently neighbours are changed in a liquid, the thicker, 
i.e., the more viscous, the liquid. Of course, an increase in tempera- 
ture, which increases the swing of molecular vibrations, results in a 
decrease in viscosity. It is also quite understandable that, tempera- 
ture conditions being equal, the liquid whose molecules are more com- 
plex will be more viscous. Many liquids harden before they become 
very viscous. The high viscosity of glue, honey, tar and oil is due to 
the complex form of their molecules. 

When a liquid hardens, the exchange of molecules practically ceases. 


240. Short- and Long-Range Order of Atoms in Alloys 


When two or more substances crystallise together, they may in 
certain cases form acommon crystal lattice. It depends on the relative 
values of the energy of interaction between homogeneous and heter- 
ogeneous particles whether such a mixed crystal is formed or not. 
If the attraction between homogeneous particles is greater than be- 
tween heterogeneous particles, a mixed crystal is not formed. 

Metal alloys, which are widely used in various industries, are 
mixed crystals. By reference to the structure of alloys, we can 
clarify the concepts of short- and long-range order. 

In the simple case of diatomic alloys, we may encounter perfectly 
ordered structures in which a definite cell can be distinguished and 
the substance described as a crystal of a compound with the definite 
formula A,B,,. However, this does not always occur and in a number 
of cases A atoms randomly replace B atoms in their lattice or, if 
they are small, randomly become lodged between B atoms. 

We shall discuss only a substitution alloy, namely, iron-cobalt 
(see Fig. 276). This alloy has a simple body-centred lattice struc- 
ture. Each atom—iron as well as cobalt—has eight close neighbours. 
As regards the arrangement of atom centres, an alloy crystal is al- 
ways perfectly ordered, i.e., the atom centres form the same body- 
centred lattice under all conditions. The situation differs with re- 
spect to the distribution of iron and cobalt atoms. Let us consider the 
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Fig. 276 


cell points of a crystal to be divided into corners and centres of cubes. 
For perfect order, all corners are occupied, say, by iron atoms and all 
centres by cobalt atoms (Fig. 276a). The ideal long-range order of 
such a crystal may gradually deteriorate if atoms begin to occupy 
“foreign sites”. But as long as the number of atoms located at their 
“own” sites differs from the number of atoms located at “foreign” 
sites (Fig. 2765), the crystal may be said to have a long-range order, 
even though the order is, in part, “impaired”. The long-range order 
disappears when the distinction between “foreign” and “own” sites 
is lost, i.e., when half the atoms are at their “own” sites and half 
at “foreign” sites (Fig. 276c). 

It is important to note that when a completely ordered crystal 
is heated, the order is gradually destroyed, i.e., the percentage of 
atoms at “foreign” sites increases. There exists a temperature above 
which even a partially “impaired” long-range order cannot exist. 
This temperature is called the A-point (lambda-point). For an iron- 
cobalt alloy, the A-point is 770°. The transition from order to dis- 
order signifies that thermal motion has gained the upper hand over 
the “tendency” of atoms to maintain a long-range order. 

There is a great deal in common between the process of eliminat- 
ing the distinction between “foreign” and “own” sites and the melt- 
ing process. Both processes result in the disappearance of a long- 
range order. However, melting results in the disappearance of the long- 
range order of atom centres, while passing through the A-point re- 
sults in the disappearance only of the order in the arrangement of 
atoms of different elements. ™ 

The basic characteristic of the structure of alloys of the iron-co- 
balt type is the possible existence of a partial long-range order. 
Such a partial long-range order can exist only with respect to the 
distribution of the iron or cobalt atoms, but not with respect to the 
arrangement of atom centres. 
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Like in the case of melting, the elimination of a long-range order 
does not mean the elimination of order in general (a short-range or- 
der remains). 

The short-range order with respect to the distribution of atoms in 
iron-cobalt crystals consists in the “tendency” of cobalt atoms to 
surround themselves with iron atoms, and vice versa. If we take any 
atom and examine its eight close neighbours, we will find that the 
number of atoms of the other element will not be equal to one half 
of the total, i.e., four. Depending on the degree of perfection of 
the short-range order, an iron atom may be surrounded, on the aver- 
age, by five, six or seven cobalt atoms. 

The investigation of a copper-gold alloy shows that its short-range 
order has a high degree of perfection. This pertains not only to the 
number of closest neighbours, but to the number of closest to the 
closest, etc. If a number of spheres are drawn about any of the gold 
atoms, it is found that the first shell contains, in effect, only copper 
atoms, while the second contains only gold atoms. In successive shells, 
the short-range order begins to deteriorate, but a predilection for 
oe of a definite element will be felt as far away as the tenth 
shell! 

It has been determined by means ‘of very precise investigations 
using X-rays how a long-range order in alloy crystals is “created”. 
Experiments with cobalt-platinum alloys have shown that the re- 
gions of long-range order grow in a disordered crystal as crystal nu- 
clei grow in a liquid. These embryonic regions are arranged in a per- 
fectly definite manner relative to the axes of a crystal. 


241. Liquid Crystals 


To find examples of liquid crystals, one must turn to organic sub- 
stances. Molecules of substances forming liquid crystals are always 
elongated. Liquid crystals are encountered among viruses, and also 
among lipoids, which are components of living tissue. 

A substance forming liquid crystals exists as such in a definite 
temperature range. If a liquid crystal is heated, it turns into an 
ordinary liquid; if it is cooled, it becomes a crystal. 

The term “liquid crystal” is derived from the strange manner in 
which the properties of a liquid and a crystal are combined. A liq- 
uid crystal possesses fluidity and forms drops. However, these drops 
may be elongated rather than spherical, and somewhat resemble jelly. 
Careful investigation shows that the order of molecules in such a drop 
is not like in ordinary liquids. 

Two kinds of liquid crystals are known. In one, the molecules are 
in a short-range order and parallel to one another. In the other, the 
order of molecules is even more peculiar. Here, the molecules are 


Sec. 2417” ; Liquid Crystals 617 





arranged in ayers Each layer consists of parallel molecules, which 
are in a short-range order. This is illustrated in Fig. 277. 
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A soap solution consists of liquid crystals. The washing properties 
of soap are directly related to its ability to form liquid crystals. A 
molecule of soap has an elongated shape 
(transverse axis measuring about 4A and 
longitudinal axis 30-40A). At one end of if 
a molecule a negative electric charge is 
concentrated and this pole is attracted by I 
water molecules. A soap solution is a liq- i 
uid crystal. It consists of alargenumber _ 
of double layers of molecules, separated by = ==—=—=7=——"= === 
layers of water (see Fig. 278). In the dou- === == 
ble layers, the poles of the molecules are 9 =7=—=>—= 
turned outward, i.e., toward the water. 

The molecules of soap within a layer are 7 i ‘ ‘ 7 

in a close arrangement and in a short-range 

order. If there is little soap in the water, 

the double layers of soap molecules are ! l 
separated by large layers of water. As more 

soap is put into the water, more and more 7T>=>= ==> === 


double layers will be produced. The solu- 2=2=====~= wip See 


tion becomes saturated when the thickness 22———~~====-— 
of a water layer equals about 20 A. The qth te ee 
double layers forming a liquid crystal Fig. 278 

possess great mobility. When we wash our 

hands, the layers slide easily relative to one another and the skin. 
Dirt from our hands collects at the poles of the molecules and is then 
released in the water. 
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242. Polymers 


A large number of organic substances, composed of giant molecules 
consisting of thousands of atoms, have a peculiar structure. Such 
substances include plastics, kapron and artificial silk. The molecules 
of these substances consist of identical groups of atoms arranged in 
a chain (whence the term “poly- 
mer”). The atoms within a mole- 
cule are frequently in a_ long- 
range order. 

The properties of high-poly- 
mers having lateral chemical 
bonds between chains differ sig- 
nificantly from those of so-called 
linear polymers, in which such 
bonds do not exist. Polymers 
having lateral chemical bonds 
between chains are rigid systems. 
Their atoms are arranged rather 
loosely and in a completely 
haphazard manner. Plastics of 
such polymers are used in the 
manufacture of buttons, kitchen 
6) utensils and various fittings. Li- 

near polymers have interesting 
properties and structures. Al- 
WOK though a number of points still 
an remain unclear, the basic struc- 
as tural features of these substances 
Se 93 are well established. 
<— In the solidification of a melt, 
or directly in the chemical proc- 
ess, long molecular chains be- 
Fig. 279 comearranged in parallel streams. 
Since solidification begins 
simultaneously from a large number of centres, numerous stacks of 
chains, which collide during growth and pass round one another, are 
formed. As a result, the final shapes are rather odd and intricate. 

The role of a stack in a linear polymer is somewhat analogous to 
the role of a crystalline particle in a polycrystalline substance. Nev- 
ertheless, there is a significant difference between the two, for the 
degree of order of a group of parallel chains forming a stack (consisting 
of thousands or tens of thousands of such chains) may vary consider- 
ably from case to case. Fig. 279 shows three kinds of order: a) crys- 
tal type—the axes of the chains form a perfect lattice and the 
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azimuths of the chains are ordered; 6) gas-crystal type—the axes of 
the chains form a lattice and the azimuths are disordered; and c) amor- 
phous or liquid type—no lattice is formed (absence of long-range 
order). It should be noted that the displacements of the chains relative 
to one another in the longitudinal direction also may be ordered or dis- 
ordered. Since a stack of chains, each of which consists of hundreds of 
thousands or millions of atoms, is very long and will constitute, 
therefore, a broken formation, it is evident that even in the case of 
ideal order in the arrangement of chains the order of a stack is not 
entirely like that of a crystal. Only the portion of a stack for which 
the parallel chains are rectilinear can have a crystal arrangement. 
This means that in the case of ideal order each stack of chains con- 
sists of a sequence of crystalline regions (crystalline particles). 

The stretching of linear polymers consists in the unfolding of 
stacks of chains. A similar mechanism of elongation enables us to 
explain extensions of up to 1,000 per cent occurring in certain nat- 
ural and artificial high-polymers. 

Rubber and polyethylene are the best known linear polymers. 
The high polymers used in the manufacture of artificial fibre also 
have the same kind of structure. 


CHAPTER XXXII 
PHASE TRANSFORMATIONS 


243. Phase Diagrams 


A substance can have only one gas state and one liquid state, but 
it can have several crystal states (also, several liquid-crystal and 
gas-crystal states). 

The gas and liquid states of a substance are characterised by disor- 
der in the arrangement of particles. In a gas, the ratio of the kinetic 
energy of the particles to their potential energy of interaction is such 
that the binding forces cannot restrain the particles from flying apart, 
to the extent to which the vessel containing the gas permits. The 
liquid state has a definite form since the binding forces do not permit 
the molecules to have independent free paths.* At high pressures, 
the distinction between a gas and a liquid disappears. 

Since two basically different random arrangements of particles 
do not exist, every substance has one liquid and one gas state. A crys- 
tal is characterised by a definite arrangement of particles and, in 
principle, an infinite number of different crystal phases can exist 
for a given substance. In actuality, several different crystal phases 
exist, as a rule, for one and the same chemical compound (diamond 
and graphite, white and gray tin, yellow and red sulphur, etc.). 

Every substance exists in one or another phase, depending on the 
external conditions, viz., the temperature and the pressure. It is 
customary to use a pressure-temperature diagram, instead of a 
table, to describe the conditions for the existence of the various 
phases of a given substance. A diagram of this kind is known as a 
phase diagram. . 

Three such diagrams areshown in Fig. 280. In the upper left-hand 
corner, we see a phase diagram for an ideal substance having only 
one solid phase. The diagram is divided into three regions: one re- 
gion indicates the conditions for the existence of a crystal, another 
for the existence of a liquid, and the third for the existence of a gas. 
The gas state, of course, is represented by the lower right-hand por- 
tion of the diagram, i.e., where the temperatures are low and the pres- 
sures high. The solid phase is represented by the region of lowest 
temperatures and highest pressures. Such a diagram is very convenient. 
In order to determine the state of a body at a pressure p and a temper- 
ature T, all we need to do is find this point on the diagram and 
observe in which region it is located. 


* In the absence of gravitational forces, a drop of liquid is spherical. 
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In the upper right-hand corner of Fig. 280, the conditions for the 
existence of the various phases of sulphur are shown. This substance 
has two crystal phases and therefore the diagram is divided into 
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four parts. In the lower part of Fig. 280, the phase diagram for water is 
shown. Since it is difficult to represent such a diagram drawn to linear 
scale on a single drawing, the pressure in this diagram has been plot- 
ted logarithmically. It will be seen that ice exists in five different 
phases, which are designated by the Roman numerals I, II, III, V 
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and VI. The phase which was originally designated as IV turned out 
to be an error. The less common phases of ice exist at higher pres- 
sures, 

On a phase diagram, dividing lines between phases, as well as 
points within the various regions of the diagram, have physical sig- 
nificance. At pressures and temperatures corresponding to points on 
the dividing lines, two boundary phases exist simultaneously. In 
the case of water, this would correspond to the condition of ice 
floating on water, with the icenot melting and the water not freezing. 
The dividing lines may be called phase equilibrium curves. 

It should be emphasised that these are phase equilibrium curves 
and not points. This means that equilibrium between two phases 
can be realised at different temperatures if the pressure is varied 
accordingly. This may also be expressed as follows: the temperature 
of phase equilibrium is a function of pressure; or, the pressure of 
phase equilibrium is a function of temperature. 


244. Phase Transformations 


Phase equilibriumcurves may also be called phase transformation 
curves since transition from one phase to another occurs when this 
line is crossed. 

The dividing line between a solid and a liquid is the fusion, or 
crystallisation, curve; and the dividing line between a liquid and a 
vapour is the vaporisation, or condensation, curve. We call the di- 
viding line between a solid and a vapour the sublimation curve and 
the lines between two solid phases simply transformation curves. 

Processes involving a change of state are also conveniently indi- 
cated on a phase diagram. Usually, we are concerned with transfor- 
mations occuring at constant temperature or at constant pressure. 
These processes are represented on a diagram by vertical and hori- 
zontal lines, respectively. 

Several phase transformations are illustrated in Fig. 280. Line 
2-1 on the phase diagram of sulphur represents a cooling process for 
a sulphur gas at normal pressure. At a temperature of 444.5°C sul- 
phur is transformed froma gas into a liquid, at 110.2°C from a liq- 
uid into a crystal phase and, finally, at 95.5°C from this crystal 
phase into another crystal phase. The compression of sulphur gas is 
illustrated in the same diagram by the process 3-4. By increasing the 
pressure, we are able in this case too to transform a gas into a liquid 
and then, at very high pressures (above point 4) into solid states. 

Under certain unique conditions, three phases may exist together 
simultaneously. Such points are called triple points. Sulphur has 
three triple points: 1) gaseous, liquid and yellow sulphur existing 
simultaneously; 2) gaseous, liquid and red sulphur existing simul- 
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taneously; and 3) a liquid existing in equilibrium with the two 
crystal phases. . 

It can be proved on the basis of strictly thermodynamic principles 
that a quadruple point cannot occur. Thus, no conditions exist un- 
der which, for example, two crystal phases are in equilibrium with 
a liquid and a vapour. 

Every phase transformation is characterised by a transition tem- 
perature at a given pressure. We speak of the melting (crystallisa- 
tion), boiling, sublimation, etc., points of a substance. If the pres- 
sureis not indicated, this usually means that the transformation occurs 
at normal atmospheric pressure. 

An important characteristic of a transformation is its heat of tran- 
sition. The occurrence of latent heat of vaporisation and heat of fu- 
sion is well known, but heat of transition is a general phenomenon. 
A transformation which proceeds by heating absorbs heat. In accord- 
ance with the second law of thermodynamics, heat of transformation 
is uniquely related to a change in entropy: 


AQ=TAS, 


where 7 is the transition temperature. Therefore, it is evident that 
a phase transformation which proceeds by heating is accompanied 
by an increase in entropy. 

The transition (fusion, boiling) temperature can be calculated from 
the formula. 

AQ 
T= AS? 
i.e., it is equal to the quotient of the latent heat of transition divid- 
ed by the increase in entropy. But in this form the statement is of 
purely theoretical significance since practically the change in entro- 
py for a phase transformation cannot be precalculated. However, 
knowing the transition temperature and heat of transition from exper- 
iments, one can determine accurately the magnitude of the increase 
in entropy. 

The phase transition from ice I to ice III occurs at a temperature ¢=—20°C 
and a pressure p=2,103 atmospheres. This transition is accompanied by the 
release of heat; each gram of ice releases AQ=5.6 cal. Therefore, the change 
in entropy is 


AS ="8 = 3-8 = 0,022 cal/deg. 


245. Phase Stability 


How do we explain the fact that under certain conditions a body 
constitutes a liquid and under others a solid? There are two tenden- 
cies which determine the nature of a state under given external con- 
ditions: first, the tendency of a body to possess a minimum of energy 
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and, secondly, the tendency to possess a maximum of entropy. The 
first tendency is a consequence of the fact that a system of molecules 
behaves like any system of mass points subject to the laws of New- 
tonian mechanics and, as we know, amechanical system tends to pos- 
sess a minimum of potential energy. The second tendency is a conse- 
quence of the second law of thermodynamics. 

During the transition from a gas to a liquid and during the transi- 
tion to a solid, the internal energy of a substance decreases. The ener- 
gy of a gas is higher than the energy of a liquid since in passing from 
a liquid to a gas work must be expended in overcoming the binding 
forces between molecules. And the energy of acrystal is lower than the 
energy of a liquid since an ordered arrangement of interacting parti- 
cles is always more stable than a disordered arrangement. This can 
be proved rigorously, but the proof will not be presented. The state- 
ment appears rather obvious. Imagine, for example, a perfect lat- 
tice of spheres connected by means of springs. Any displacement 
of a sphere requires a certain amount of work. Hence, an ordered ar- 
rangement corresponds to a minimum of energy. 

Entropy behaves in a different manner. Roughly speaking, the 
greater the freedom of motion of the constituent particles of a body, 
the greater its entropy. A disturbance in the order or an increase in 
the separation between particles results in an increase in entropy. 

Thus, for a given temperature and pressure, the state of a substance 
is established as a compromise between entropy and energy. Using 
the second law of thermodynamics, we can obtain a quantitative ex- 
pression for this general law. 

Imagine that a body is placed under “foreign” conditions, i.e., 
ice under the conditions for the existence of steam, etc. In such a 
case, an irreversible phase transformation (fusion, vaporisation, etc.) 
will occur in accordance with the second law of thermodynamics: 
the increase in the entropy of a body will be greater than the applied 
reduced heat, 


dQ 
Using the first law of thermodynamics, we can rewrite the inequali- 
ty in the form 


dS>UAP® and dU—TdS-+ pdo<0. 


Since the phase transformation occurs at constant temperature, we 
obtain 
d(U—TS) + pdv<0. 


If the process is not accompanied by a change in volume, 
the transition to a state of equilibrium takes place with d(U—TS)< 


Sec. 245] Phase Stability 625 





<0, i.e., with a decrease inthe quantity F=U—TS. This function 
is called the free energy. We have shown that a spontaneous phase 
transformation is accompanied by a decrease in free energy, i.e., 
the free energy of a stable state must be a minimum. 

« If the process occurs at constant pressure, the transition to an equi- 
librium phase takes place with d (U—TS-+pv)<0, i.e., with a de- 
crease in the quantity O=U—TS-+-pv. This function is called the 
thermodynamic potential. Thus, a phase transformation at constant 
pressure is accompanied by a decrease in thermodynamic poten- 
tial, i.e., the thermodynamic potential will have a minimum value 
at equilibrium. ; 

The opposing tendencies of entropy and internal energy are brought 
out in this, statement: a decrease in energy and an increase in 
entropy result in a decrease in free energy or thermodynamic poten- 
tial. These two tendencies have been expressed quantitatively in 
the relations showing that F and ® tend to a minimum. 

The formulated condition for phase equilibrium has numerous ap- 
plications. For example, using this condition, we can derive a rela- 
tion for the slope of a phase equilibrium curve. 

On such a curve, consider two points representing the external con- 
ditions T,, p, and T,, p,. The equilibrium conditions for these points 
have the form 


a, cae pP,) =, (T,, P,) and ®, (T,, p,)=, (F3 D2) 
The subscripts of © refer to the phases in equilibrium. Subtracting 
the first equation from the second, we obtain 
®, (T,, P.) —®, Gay P,) =, Caer p,) —®, (Ts ~,)- 


_ Let us assume that the two points are close to each other. Then, 
by means of the formula for the increment of a function of two 
variables, we can transform the above equation into the form 


6D, am, __ o@ oD 

oF dT + 55 dp= 57 aT + op oP: 
. Substituting the values of the derivatives of the function OD=U— 
-TS-+pv, namely, r= —S and ae we obtain 


dp _S,—S, AS 
aT” v,—v,—OUy — DQ 





But since AS= 40 » 
os oa (Clapeyron-Clausius equation). 


243580 
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Thus, the slope of the curve | the derivative a) is detérmined 
by the latent heat of fusion AQ, the temperature of the phase transi- 
tion T, and the difference in the volume of the phases. If AQ is posi- 
tive, this means that the subscript | refers to the high-temperature 
phase. 

Let us apply the Clapeyron-Clausius equation to the case of melting ice. 
When ice melts, 1 cm® of water is obtained from 1.091 cm? of ice. The volume 
change vij—ve2 is equal to —0.091 cm* (the volume decreases). In this case, AQ 
will be the heat of melting and is equal to 80 cal/gm. The temperature T equals 
273°K. Hence, 

aT TXAv__ (273)(—0.091)__ 0.3] deg cm?® 
dp AQ 80 ; cal 


Since the dimensions of the above result somewhat obscure its significance, 
let us convert calories to atmospheres, recalling that 1 cal=42.7 kg cm= 
42.7 atmem®. We obtain 


ef 2-6: 0075 2. 


dp atm ° 


Thus, increasing the pressure by 1 atm decreases the melting point of ice by 
0.0075 degree. 


246. Metastable States 


The above thermodynamic explanation of phase transition phe- 
nomena does not explain a number of observed facts. Thus, from the 
thermodynamic viewpoint, for a given p and T there can occur a sin- 
gie state (a point in one of the regions of a phase diagram) for which 
the free energy, or thermodynamic potential, assumes a minimum 
value. However, it is possible for graphite and diamond to exist side 
by side, and water can be obtained under the conditions for the. 
existence of ice (supercooled water). Numerous other examples in 
which the above thermodynamic principles are violated may be cited. 
The situation can be described as follows: In addition to states 
which are stable under given external conditions, so-called meta- 
stable states may aslo exist. 

The free energy of a metastable state is not a minimum, but nev- 
ertheless the transition from this state to a state having a minimum 
energy is impeded. Different metastable states may differ considera- 
bly in their degree of stability. Sometimes a slight impulse suffices 
for a transition to occur to a “normal” state, while in other cases a 
metastable state may be, in actuality, no less stable than a “normal” 
state. 

Various phase transformations can be delayed. Thus, water can be 
supercooled, i.e., at normal pressure, water may exist at a tempera- 
ture below 0°C; water can aslo be superheated, i.e., its temperature 
may be raised above 100°C without boiling. A vapour also may be 
obtained under atypical conditions (a supercooled vapour is said to 


Sec. 246] Metastable States 627 


be supersaturated). Transformation delays always occur in the solid 
state, i.e., the transformation of one crystal phase into a second is 
delayed even though the conditions prevailing are those for the sta- 
ble existence of the second phase. 

However, one type of transformation, namely, fusion, is never 
delayed. Thus, a crystal cannot exist under conditions that are sta- 
ble for the liquid phase. 

We frequently have occasion to deal with supercooled liquids. 
Liquids such as glycerine considerably increase in viscosity when 
supercooled and may remain in the amorphous state for months or 
even years. Glass is another example of a supercooled liquid. 

The existence of a metastable state can be demonstrated in the 
case of a supercooled liquid by bringing the liquid into contact 
with a crystal. In such cases, crystallisation begins immediately. 
If the liquid is highly supercooled, the effect will be extremely vio- 
lent. When a snowflake is thrown into supercooled water, ice need- 
les dart through the water in all directions and in a few seconds 
the transformation is complete. 

Delays of crystal-crystal transformations are particularly interest- 
ing. Here, delays can occur, so to speak, in both directions. Yellow 
sulphur should be transformed into red sulphur at 95.5°C .If sul- 
phur is rapidly heated, this transformation point may be “skipped” and 
the sulphur may be brought to fusion at a temperature of 113°. Now 
assume that the melt is gradually cooled. At 113° small crystals of 
red sulphur are formed. Cooling does not result in a transformation 
at 95,5°, and even at room temperature small crystals may exist for a 
considerable period of time. However, the transformation process pro- 
ceeds, even though slowly, and within a day is complete, i.e., a yel- 
low powder is obtained. Here, too, the metastable nature of the 
state is best demonstrated by dropping a small crystal into the melt. 

In certain cases, we are interested in a substance in a phase which 
might be expected to exist under entirely different conditions. An 
example of this is white tin which is transformed into gray tin when 
the temperature is reduced to 13°C. Usually, we are more interested 
in white tin and are cognisant of the fact that in winter nothing can 
be done with it, However, white tin excellently withstands 20-30% 
of supercooling and only under severe winter conditions does it 
begin to be transformed into gray tin. (The members of the Scott 
expedition to the South Pole perished as a result of ignorance of 
this fact. Liquid fuel taken on the expedition had been placed in con- 
tainers soldered with tin. At the extremely low temperatures prevail- 
ing, the white tin was transformed into a gray powder. As a result, 
the containers opened and the fuel was lost.) 

To explain transformation delays, let us consider the difference 
between liquid-crystal and crystal-crystal transformations on the 
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one hand and crystal-liquid transformations on the other. In the 
last case the long-range order of atoms disappears, while in the 
first two a long-rangeorder is created. The elimination of long-range 
order does not require a great effort. Fusion begins at the surface; 
atom after atom is torn away from its neighbours and falls out 
of strict order. 

On crystallisation, short-range order is transformed into long- 
range order. The process begins at the surface and must proceed in- 
wardly, i.e., into the substance. The atoms, or molecules, are “forced” 
to establish strict order under extremely crowded conditions. 
Their motions must be harmonised for order to be established. 
As we have seen, the rearrangement of atomic order, which 
requires that atoms undergo “organised” displacements from certain 
ordered positions to others, is all the more difficult. 

Transformations in the solid state always begin at the boundaries 
of grains, blocks, empty spaces, and at dislocations; in other words, 
wherever there is more freedom. If only several score atoms have 
occupied positions corresponding to a new order, oriented growth 
of the nucleus proceeds, i.e., one after another atoms begin to pass 
from the old, less favourable order, or in the case of crystallisation 
from disorder, to the new order. This is the effect of a crystal particle, 
or seed, which invariably puts an end to a supercooled state. 


247. Gas => Liquid Transformations 


Vaporisation consists in the separation of fast-moving particles 
from the surface of a liquid. Two conclusions immediately follow 
from this, namely, vaporisation increases with increasing tempera- 
ture and requires the application of heat. If the vaporised molecules 
are continuously removed from the surface of the liquid, the vapor- 
isation process continues until all of the liquid has been transformed 
into vapour. 

Let us consider vaporisation in a closed vessel. In such a case, not 
only do molecules separate from the surface of a liquid, but the re- 
verse process also occurs, namely, vapour molecules return to the 
liquid. The vaporisation process will continue until dynamic equi- 
librium corresponding to the given temperature has been estab- 
lished. Of course, the liquid may completely vaporise without equi- 
librium being established with the vapour. 

When equilibrium exists, we say that a vapour is saturated. The 
pressure of asaturated vapour is a function of the temperature and is 
given by the phase equilibrium curve. By changing the tempera- 
ture, we either vaporise more of the liquid in a vessel or condense 
some of the vapour. This results in a change of the vapour pres- 
sure, 
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It is clear why the density and pressure of a saturated vapour 

increase with temperature. The number of molecules leaving a liquid 
rapidly increases as the kinetic energy of the molecules increases. 
On the other hand, the number of vapour molecules returning to a 
liquid is almost independent of the temperature since such a process 
requires no energy. 
. The density of saturated vapour 
at a given temperature varies from 
substance to substance within a 
broad range. At room temperature, 
the density of saturated steam is 
equal to 13 mm, while that of sat- 
urated mercury vapour is only 
0.005 mm, 

Aclear picture of transition proc- 
esses from gases to liquids may be 
obtained by considering isother- 
mic compression of a gas, i. e., 
“vertical” processes in a phase dia- 
gram. In order to represent volu- 
metric changes, which are not 
shown in a phase diagram, let us 
draw an auxiliary diagram in which 
pressure is plotted as a function 
of volume (see Fig. 281). Fig. 281 

If gas compression occurs at a 
sufficiently low temperature, sooner or later we arrive at an inter- 
section point with a phase equilibrium curve. At this instant, the 
pressure is equal to that of saturated vapour at the temperature of 
the experiment and the first drops of liquid appear. As long as 
a vapour is not completely transformed into liquid, the compressing 
motion of the piston will not be accompanied by a change in pres- 
sure since we remain at the same point in the phase diagram from 
the beginning to the end of condensation. The condensation proc- 
ess will be indicated by a horizontal line on the pressure-volume 
curve. 

The significance of points of a rectilinear segment on the diagram 
is clear. They describe a two-phase liquid-vapour system. Each 
point of such a segment corresponds to a definite ratio betwee the 
phases, which can be easily determined by means of a “lever” rule. 
Let us designate the volume of the liquid by v,, the volume of the 
vapour by vy, and the proportion of substance in the liquid state by 
x. Then, the volume of the wet mixture is given by 





v= xv, + (J — x) z,. 
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Hence, 
Ug—v 
Up,—0," 





x= 


This is the “lever” rule. 

When the condensation process is completed, a steep rise occurs 
in the curve since liquids have very low compressibility. 

Now, let us increase the temperature to that of the next isotherm 
on the diagram. It will be practically the same as the first, except 
for one important difference, namely, condensation begins later 
since the pressure of a saturated vapour is greater at a higher tempera- 
ture. Moreover, condensation is completed earlier since the piston 
does not reach its preceding position owing to the thermal expansion 
of the liquid. 

By increasing the temperature further, we obtain a series of iso- 
therms in which the horizontal segments corresponding to two-phase 
systems become shorter and shorter. Finally, this segment disap- 
pears entirely. The decrease in the length of the horizontal segment 
indicates that the specific volume of the liquid is approaching that 
of the vapour. At a certain critical temperature, these volumes be- 
come equal and the isotherm no longer has a horizontal portion. In 
the pressure-volume diagram, the critical point is easily determined 
as the apex of the dotted two-phase region. In a diagram of state, 
the critical point is located where the liquid-vapour phase equilib- 
rium curve breaks off. 

As the temperature is increased, the isotherms resemble broken 
curves less and less and gradually approach the hyperbolas of an 
ideal gas. 

The existence of a critical point indicates that we were justified 
in stating that there is no basic difference between a gas and a liquid. 
We see that if we by-pass the critical point a transition from a liquid 
state to a gas state can be achieved without going through a phase 
transformation. 


248. Liquefaction of Gases 


We cannot obtain a liquid by compressing a substance the temper- 
ature of which is above the critical point. When such a substance is 
highly compressed, it becomes very dense, with its molecules coming 
into close contact with one another. Nevertheless, a liquid in the 
usual sense of the term cannot be obtained. Thesubstance cannot be 
poured into a glass like an ordinary liquid, i.e., its state has no dis- 
tinctive form. This is due to the fact that the phase equilibrium 
curve was not crossed during compression. The absence of such a cross- 
ing indicates that a two-phase liquid-gas system cannot be obtained. 
This means that we cannot get a liquid with a definite form; instead, 
the liquid fills the entire volume available to it. 
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Compression must take place at temperatures lying below the crit- 
ical point if a gas is to be liquefied. This is easily achieved if the 
required temperatures are reached by thermal exchange with cold 
bodies. However, in the case of oxygen, nitrogen and hydrogen, this 
is not possible since the critical temperatures are very low. In order 
to liquefy these gases, we must resort to Joule-Thomson or adiabatic 
cooling. 

In the former case, the gas is compressed by means of a compressor 
and passed through a refrigerator. Then, the gas enters a spiral 
tube and is allowed to escape through an aperture, which serves as 
the porous plug (partition) in the Joule-Thomson experiment (see 
p. 173), to a region of lower pressure (atmospheric). Upon expanding, 
the gas cools, rises to the top, and cools the spiral tube. Thus, each 
successive portion of escaping gas will be colder than the preceding 
one. Finally, a temperature is reached at which the gas is trans- 
formed into liquid. 

The other method of liquefying gas involves the use of an expander. 
In a reciprocating expander, a gas is expanded adiabatically, per- 
forming work in moving a piston, and leaves the cylinder at a lower 
temperature. By having each portion of gas cool the subsequent one, 
one can reduce the temperature to —150°C. Further cooling is imped- 
ed by the absence of suitable lubricants to maintain the friction 
between the piston and the cylinder walls at a low level. Asolution 
to this problem was found by P. L. Kapitsa, who developed a refrig- 
erating turbine—a turboexpander. A turbine is rotated by means 
of the gas from a compressor. The gas expands adiabatically, is 
cooled, and cools the subsequent portion of gas. Difficulties of 
lubrication are overcome by placing the bearings, requiring lubri- 
cation, external to the cold region. 


249. Gas == Crystal Transformations 


“ 


When it is said that a substance “ vaporises”, the reference is 
usually made to the vaporisation of a liquid. The vaporisation of 
solids ‘is called sublimation. One of the most familiar examples of 
evaporation of a solid is the sublimation of naphthalene. 

Every odorous solid sublimates to a significant extent. The odour 
is produced by the molecules separating from the substance and 
reaching our olfactory organs. Usually, however, a substanceswill 
sublimate to an insignificant extent. Sometimes, sublimation may 
not be detected even by very careful investigation. But, in prin- 
ciple, all solids, including iron and copper, vaporise. If sublimation 
is not detected it simply means that the density of the saturated 
vapour is extremely low. That this should be so is quite natural. The 
motion of atoms and molecules in a solid is very ordered and there 
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is little probability that a molecule will separate from the surface 
of the solid. 

The density of a saturated vapour in equilibrium with a solid in- 
creases with increasing temperature. It can be shown that a number 
of substances having a strong odour at room temperature do not 
manifest it at a reduced temperature. In most cases, the density of 
the saturated vapour of a solid cannot be increased significantly 
simply because the substance melts first. 

Vapours are frequently used to obtain crystals when the latter are 
required in very pure form. This can be accomplished, for example, 
by precipitation on slightly cooled glass. 


250. Liquid Crystal Transformations 


The transition from a liquid to a solid state (crystallisation) and 
the reverse transition (fusion) involve a fundamental rearrangement 
of particles. Upon fusion, long-range order in the arrangement of 
molecules or atoms disappears. 

For a given pressure, fusion occurs at a very definite temperature. 
The vibrations of molecules or atoms become so intense that the 
maintenance of long-range order becomes impossible. 

If the temperature is maintained at the fusion point, a liquid and 
a crystal may remain in a state of equilibrium, like in the case of a 
liquid and a saturated vapour. Crystals will neither grow nor melt. 

External pressure changes the fusion temperature. As arule, the 
fusion temperature increases with pressure, i.e., fusion becomes 
more difficult. However, there are several exceptions to this rule. 
One of these is ice. The melting of ice is facilitated by an increase 
in pressure. In terms of a phase diagram, we can briefly describe the 
none) and anomalous behaviour of bodies as follows: Usually, 


0, i.e., the equilibrium curve forms an acute angle with the 


temperature axis. In the anomalous case, a <0 and the curve forms 


an obtuse angle with the abscissa. The strange behaviour of ice is 
related to another anomaly, namely, ice is lighter than water. The 
relationship of these two anomalies is shown in the following equa- 


oP Fu, Q The overwhelming 


majority of solids are denser than their ‘iands: Evidently, under 
such conditions, a pressure which produces packing should facili- 
tate fusion. The relationship between the two anomalies is quite 
natural. Consider a liquid and a crystal in a state of phase equilibri- 
um. Let us raise the pressure without changing the temperature. 
The atoms should approach one another. If the solid is denser, the 


tion, which was derived above: 
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liquid is transformed into.a crystal state. If the liquid is denser, the 
reverse transition occurs. 

The anomalies of water play an extremely important role in our 
lives. If they did not exist, rivers would freeze at the bottom. The 
anomalies of ice and other such bodies are due to the structures of 
these bodies. Ice crystals do not conform to the law of compact 
packing of particles. Thus, a disturbance of long-range order 
results in an increase in density rather than a decrease, as is usually 
the case. 

Let us return to Fig. 264 (seep. 598). The broad ice channels may 
displace molecules of ice by expanding somewhat. When ice melts, 
molecules may “fall” into this channel. In such a case, of course, the 
density will increase. No theory exists by means of which the heat 
of melting or the melting temperature could be predicted. This is 
due to the dependence on a great many structural factors. To be 
sure, the heat of melting is easier to predict, since the melting 
temperature is equal to the heat of melting divided by the entropy 
of melting. 

A measure of the binding forces between molecules or atoms is, 
of course, the heat of sublimation (the energy required to break the 
intermolecular bonds) rather than the heat of melting (the energy 
required to eliminate long-range order). 

As indicated above, fusion of crystals cannot be retarded. On the 

other hand, crystallisation can be retarded and, in fact, sometimes 
will not occur at all. In order for crystallisation to begin in a 
liquid, there must appear a nucleus, i.e., asystem consisting of sev- 
eral scores of atoms or molecules which have assumed an arrange- 
ment corresponding to that of acrystal of the substance. Moreover, 
conditions in the liquid must be favourable for the growth of this 
nucleus. In most liquids, it is difficult to achieve a significant re- 
tardation in the process of formation of nuclei. Cooling under strict 
conditions is required to achieve such retardation. Dust particles 
must be prevented from falling into the liquid and all mechanical 
disturbances such as vibrations and the jarring of the vessel contain- 
ing the liquid must be avoided. 
- At sufficiently great supersaturation, it is probably impossible to 
avoid the spontaneous formation of nuclei, i.e., to avoid the stabi- 
lisation requisite for the crystallisation of atomic or molecular groups. 
But something else may occur. When the temperature is decreased, 
the mobility of the particles may decrease to such an extent that 
the rate of growth of crystalline nuclei approaches zero. That is how 
glass is formed. Crystals will invariably grow when a few crystal par- 
ticles (seeds) are to be found in a liquid under conditions of ther- 
modynamic’ equilibrium with a crystal phase. Crystals for indus- 
trial purposes are grown by means of seeding. 
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If heat is removed very slowly, i.e., the temperature decreases a 
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mechanism of explaining growth_is most applicable. It i 
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It even has been possible to grow a single crystal of a hexagonal 
phase from a single crystal of a cubic phase by means of transforma- 
tions of this kind. Usually, this is not possible since the growth of 
crystals of a new phase begins simultaneously from many centres 
and a monocrystal becomes transformed into a fine crystalline sub- 
stance. In most cases, a crystal crumbles when it is transformed into 
another solid phase. Sometimes the outer “shell” of a polyhedral 
monocrystal is preserved and a fine crystalline substance occupies 
this perfectly symmetrical volume. 

The reason for the difficulty is clear. Crystals of anew phase may be- 
gin to grow from various points. But close layers in a cubic face-cen- 
tred lattice may be formed by four differ- 
ent systems. Let us return to Fig. 261 
(p.596).In the crystal shown in this figure, 
close planes are perpendicular to the spa- 
tial diagonals, of which there are four in 
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Fig. 284 


a cube (eight corners). Thus, hexagonal crystals of four different 
orientations may grow from a crystal with a cubic packing arrange- 
ment. G. V. Kurdyumov’s work, devoted to transformations of 
iron and steel, laid the basis for the study of the rearrangement of 
atoms in phase transformations. At high temperatures, iron exists 
in the form of a compact cubic packing of atoms. At low tempera- 
tures, the iron atoms become arranged in a body-centred lattice. 
This transformation, known as a Martin transformation, is of tre- 
mendous importance in metallurgy* and, therefore, should be con- 
sidered in greater detail. 

In Fig. 284, we see what occurs when the temperature is increased. 
The left diagram again shows a compact cubic packing arrangement; 
the right diagram shows a body-centred packing arrangement drawn 


* The hardening of steel is nothing more than a Martin transformation. 
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in a rather unusual form: we see the projection of an arrangement of 
atoms as it appears when viewed along a plane diagonal of a cube. 
It would seem that these two diagrams have little in common, the 
main difference being that the left diagram represents a three-sto- 
reyed structure and the right a two-storeyed structure (the trian- 
gles are in the second storey). Less important is the difference in the 
angles of the rhombuses (not shown in the figure). As the temperature 
is increased, the atomic vibrations increase and the less compact 
body-centred lattice becomes less advantageous at a temperature 
of 906°C. The two-storeyed structure becomes transformed into the 
three-storeyed one by the alternate shifting of the layers marked by 
triangles. For example, the odd layers shift to the left and the even 
ones to the right. This shift occurs along the diagonal of a rhombus; 
the angle of the rhombus changes at the same time. 

When a phase transformati:n occurs in iron, crystal particles of 
the new phase may become oriented in any of 24 different directions. 
The number 24 is obtained in the following manner. There are 
four close planes in a cubic face-centred crystal and, as can be 
easily shown, a crystal of the new phase grows in six different direc- 
tions in a close layer. 

Undoubtedly ordered, regular processes play an important role in 
the transition from one order to another. In such a rearrangement 
of order, atoms do not have to interchange places, i.e., only an or- 
ganised shifting of atoms occurs. This is what takes place in a Mar- 
tin transformation, a transformation which does not involve diffu- 
sion. However, in other transformations in a solid, diffusion phe- 
nomena may play an important role. 


252. Diffusion in Solids 


It has long been known that foreign atoms diffuse in a solid. The 
surface layer of steel may be impregnated with carbon (cementation), 
nitrogen or boron. Diffusion occurs to a great depth and it is not par- 
ticularly difficult to follow the process. Atatemperature of 200-300°C, 
significant quantities of silver penetrate lead to a depth of several 
centimetres in an hour. 

However, not only foreign atoms migrate in a crystal. An iron atom 
can migrate in a crystal of iron and a copper atom in a crystal of 
copper. If a piece of radioactive copper is pressed against an ordi- 
nary piece of copper, the latter will soon become “contaminated” 
(radioactive). By means of tagged atoms, one can study the diffusion 
of atoms of the same kind, as well as of “foreign” atoms. 

Diffusion is possible as a result of thermal vibrations. When an 
atom: leaves its equilibrium position, a neighbour ‘takes its place, 
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Upon returning, the atom occupies the vacated spot. Thus, atoms 
interchange positions. Such an interchange is, of course, not easily 
achieved if only two atoms participate in the process. When two atoms 
interchange positions in a solid, a whole group of atoms are involved. 
An atom slips forward only when the thermal vibratiofs of many 
atoms accidentally create favourable conditions for this. 

Dislocations, empty spaces and fractures, which always exist in 
a crystal, play an important role in diffusion. The presence of an 
empty space in a crystal facilitates the step-by-step migration of 
an atom through the lattice. An atom hindering this migration is 
“pushed” into the empty space. 

If a foreign atom is not very large, it may move through the lat- 
‘tice without interchanging positions with lattice atoms. When the con- 
ditions become favourable, such an atom slides from one empty space 
in the compact packing of spheres to the next. 

Diffusion is a two-way effect. If a zinc plate is pressed against a 
copper cne, zinc atoms will penetrate the copper and copper atoms 
will penetrate the zinc. To be sure, the rates of flow in the two direc- 
tions may vary considerably. 

The diffusion of atoms through a crystal depends on many factors. 
It is interesting that a diffusion process proceeds most rapidly 
when the foreign atoms differ in all respects from the atoms of 
the crystal through which they move. Diffusion proceeds most 
slowly for atoms which are the same as these of the crystal or in 
the same column of the Mendeleyev periodic table as those of 
the crystal. 

As already indicated, the presence of fractures and dislocations 
facilitate diffusion. Therefore, diffusion proceeds most rapidly in 
a deformed metal. The rate of diffusion is greatly dependent on tem- 
perature. This is not surprising since the diffusion coefficient, the 
coefficient of proportionality between flow of matter and concen- 
tration gradient, can always be represented by an expression of the form 


Ae- UAT, 


where U is the height of the potential barrier which an atom must 
surmount in an elementary diffusion event. The existence of such 
a relationship is rather evident since the diffusion coefficient must 
be proportional to the number of atoms the energy of which suffices 
to cross the potential barrier. 

Such barriers are quite high. In the caseof self-diffusion, they are 
usually about 1-2 ev. It will be recalled that kT at room temperature 
is equal to ~ 0.03 ev. The number of atoms with energies that much 
greater than the average is very small; hence, practically no diffusion 
occurs, At a temperature of the order of 1,000°C, the situation is 
entirely different. 
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In the preceding article, we discussed crystal-crystal transforma- 
tions occurring in an organised manner without diffusion. It should not 
be assumed that all phase transitions occur this way. On the contra- 
ry, at a sufficiently high temperature, the interchange of positions 
by atoms begins to play a very important role and the organised 
nature of transitions will be achieved in only small regions, or may 
even be completely obscured by the interchange of positions 
by atoms. 

If a solid state transformation is of a diffusive nature, it proceeds 
at a rate that is comparable to that of self-diffusion processes. The 
heights of potential barriers surmounted by atoms during rear- 
rangement are of the same order as during self-diffusion processes. 

In the case of organised displacement of atoms of the Martin 
transformation type, the transformation proceeds at a ten-fold 
tate at low temperatures. 


CHAPTER XXXIV 


DEFORMATIONS OF BODIES 


253. Elastic Properties 


For every solid, there exists a distorting force limit up to which 
a deformation is elastic. This means that if the elastic limit is not 
exceeded, the body returns to its original state. 

Elastic deformations, like other deformations, are associated with 
the displacement of atoms (or molecules). When a body is extended 
elastically the interatomic spacing increases and when it is com- 
pressed the interatomic spacing decreases. 

The distinctive feature of elastic deformations is that they do 
not destroy interatomic bonds or create new ones. 

When a crystal is deformed elastically, all of the atoms continue 
to have the same neighbours. Thus, in elastic displacement, the lat- 
tice of a crystal as a whole becomes deformed (sloped). Hence, each 
atom continues to have the same neighbours. This enables the body 
to return to its equilibrium state when the distorting force is re- 
moved. 

The change in interatomic spacing that may be achieved by means 
of elastic extension or compression is quite small. The maximum rel- 
ative elongation of an elastic nature does not, as a rule, exceed 
0.001. This means that for interatomic spacings of the order of 
2A the equilibrium positions of the atoms may be displaced by no more 
than 0.002 A. Such small changes in a lattice period can be detected 
through X-ray analysis by observing the displacement of diffraction 
lines on a roentgenogram. This requires that the lines be filmed 
at large 6 angles, since only in this manner can small changes in 
interplanar spacings be detected (see p. 389). 

The elastic deformation of polymers such as rubber is of an entire- 
ly different nature. Rubber has mechanical properties which basi- 
cally differ from those of crystalline substances. The fundamental 
difference lies in the magnitude of elastic elongation. Certain 
kinds of rubber may be stretched to 10-15 times their normal length 
without exceeding the elastic limit. Thus, they may be elongated 
10,000 times more than metals! The magnitude of the modulus of 
elasticity of rubber is no less striking. 

A steel wire having a cross-section of 1 mm? will stretch one twen- 
ty-thousandth of its length under the action of a load of | kg, but 
a rubber band having the same cross-section will stretch to twice 
its original length under the action of such a load. 
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Two processes occur when polymers are stretched. First, tangled 
bundles of molecules become disentangled. At the same time, there 
occurs regular packing of certain portions of the disentangled bun- 
dles of molecular chains into a three-dimensional order. Evidently, 
the fact that crystallisation takes place upon stretching is of second- 
ary importance, since the established order does not remain 
when the external force is removed, i.e., the bundles of molecules 
become twisted once again. 

The twisting of bundles of molecules is accompanied by an in- 
crease in entropy, i.e., an increase in the degree of disorder. It turns 
out that the internal energy of rubber and similar polymers practi- 
cally does not change when such substances are elastically de- 
formed. Therefore, the work of stretching, which according to the fur- 
damental laws of thermodynamics isdA =dU—TdS, in this case sim- 
ply equals —TdS, i.e., it is directly proportional to temperature. 
(It should be recalled that the work of external forces on a system is 
considered to be negative.) In this respect, the elastic deformation 
of rubber is of the same nature as the isothermal compression of a 
gas (cf. p. 176). 

It should be noted that both the elastic deformation of a crystal 
and that of rubber do not yield a new potential energy minimum. 
In the case of a crystal, this is due to the fact that the energy does 
not change at all. 


254. Plastic Properties 


Slippage. The elastic deformation of a crystal consists in the chang- 
ing of its interatomic spacing with each atom maintaining its same 
neighbours. On the other hand, in the case of a plastic deforma- 
tion—a deformation which remains when the external force pro- 
ducing it is removed—atoms surmount their potential barriers and 
enter new “potential wells”, i.e., change their neighbours. The basic 
mechanism of plastic deformation is the slippage of one atomic plane 
relative to another. An element of such slippage consists in the dis- 
placement of all of the atoms by one period. This can be detected 
with the naked eye in the form of so-called slip bands. Slippage 
occurs at the weakest points (fractures and other defects) and the 
crystal breaks up into layers (slip stacks). The plotting of stack thick- 
nesses, the order of magnitude of which is equal to several tenths 
of a micron, yields a random distribution curve. The forces required» 
to displace atomic planes having different indexes will differ. Usual- 
ly, it is-easiest-to displace the planes which are most compactly filled 
with atoms: However, slip planes of crystals may change with changes, 
in-.temperature arid impuritiés and also during ‘the “deformation 
process itself. In aluminium, the plarie (111) is ‘a slip plarié. 
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Slippage occurs along a given plane having a definite orientation. 
Usually this plane has the densest distribution of atoms (e.g., [101] 
in an all-sided face-centred cubic lattice). 

In order for displacement to begin, a certain minimum stress 
(ultimate shearing stress) is required. The magnitude of this stress 
is very small, reaching several grams per square millimetre in 
some cases. In measuring the ultimate stress, one must, of course, 
take into account the orientation of the slip plane relative to the ex- 
ternal force. 

Strength. A monocrystal of zinc can be easily bent by hand. How- 
ever, it is not possible to straighten it in the same manner. This 
is due to the fact that its strength has increased. : 

The resistance of a crystal to displacement increases with increas- 
ing deformation. Therefore, plastic displacements along a given slip 
plane do not cause the material to rupture, but rather cease when the 
strength is sufficient to oppose the external force, and thereupon dis- 
placement begins in other planes. Thus, the number of slip bands 
increases and the slip stack thicknesses decrease. 

If an external force is applied to a crystal previously subjected 
to plastic deformation, such deformation will resume, of course, when 
the magnitude of the force reaches the value at which it previously 
ceased to be effective owing to an increase in strength. It can be stat- 
ed, therefore, that an increase in the strength of a crystal increases 
its elastic limit—and, moreover, by a large factor. 

One theory relates the described increase in the strength of a crys- 

tal to a disturbance of the regularity, i.e., distortion, of its lattice. 
From this viewpoint, it is quite natural that thestrength of a crystal 
should increase with increasing rate of deformation and decrease 
with increasing temperature. However, from the viewpoint of the 
dislocation theory discussed above, the picture is different. 
- Plastic Deformation as a Displacement of Dislocations. Let us 
consider in greater detail the process of displacing one atomic plane 
relative to another. If there are no dislocations in a slip band, it is 
necessary to shift every row of atoms in the displacement plane. The 
situation is quite different when a displacing force acts on a crystal 
containing dislocations. ; 

Fig. 285 shows a compact packing of spheres which contains 
a simple dislocation (only the end spheres of the rows are shown). 
For simplicity, let us assume that the dislocation region embraces 
a minimum number of rows. Then, the dislocation consists basically 
in the following: Between two rows of the upper, extended layer, 
adjoining the boundary between blocks, there is a linear gap. In the 
lower, compressed layer on the other side of the boundary between 
blocks, there is an extra row of atoms (the two rows of atoms just 
below the linear gap are very compressed). Now, let us begin to dis- 
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place the upper block to the right relative to the lower one. 
At a certain initial instant, the “fissure” is between rows 2 and 3; 
rows 2’and 3’ are compressed. As soon as the force becomes 
effective, row 2 moves into the “fissure’, the shape of sphere 
3’ is restored, and sphere /’ becomes compressed. The en- 
tire dislocation has shifted to the left, and it will continue to move 
in this direction until it is “pushed out” of the crystal. In other words, 





Fig. 285 


displacement consists in shifting the dislocation line along the dis- 
placement plane. It is clear that a much smaller force is required 
to achieve displacement when dislocations are present. 

According to calculations, the strength of a crystal in which there 
are no dislocations should be a hundred times as great as the strength 
of an actual crystal, determined experimentally. The presence of 
a small number of dislocations suffices to decrease the strength to 
a small fraction of that of an ideal crystal. 

Fig. 285 shows how a dislocation is “pushed out” of a crystal by 
an applied force. Thus, as the degree of deformation is increased, the 
strength of a crystal increases. When the last dislocation of a crystal 
is eliminated, its strength will be about a hundred times as great 
as that of a perfectly normal crystal. In this manner, an increasein 
strength can be easily explained. To be sure, in order to obtain quan- 
titative agreement between calculations and experimental results? we 
must assume that special as well as ordinary dislocations may aid 
displacement. 

Excellent confirmation of this theory is provided by the fact that 
the strength of perfect crystals which are grown artificially is ap 
proximately equal to the calculated value for an ideal crystal. 
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255. Ultimate Strength 


As the stress within a body increases, its deformation increases 
up to a certain point (the elastic limit). Then, plastic deformation 
begins and because of an increase in strength the curve rises sharp- 
ly. Finally, the rupture point is reached. 

The ultimate strength of a body depends, to a certain extent, 
on the duration of an applied force. When the force is of prolonged 
duration, the ultimate strength drops. The values given in engi- 
neering handbooks generally refer to short-duration tests. The de- 
pendence of the ultimate strength of a body on the duration of an ap- 
plied force indicates that in slow processes the behaviour of a solid 
depends on internal diffusion processes. It may be assumed, for exam- 
ple, that even small forces can create a favourable trend for proc- 
esses involving an interchange of positions by atoms. 

The value calculated for the ultimate strength of a perfect crys- 
tal is several hundred times as great as measured values. This sig- 
nifies that faults play a fundamental role in a crystal. The calculat- 
ed value for the rupture strength of a monocrystal of rock salt is 
200 kg mm?. Under ordinary conditions, arod of this substance will 
rupture when it is subjected to a load of 0.5 kg/mm’. 

The roie played by fractures was demonstrated in the well-known 
experiments of A. F. Yoffe, who investigated the rupture of rods 
of rock salt in water. Water dissolves the surface of such a rod and 
“heals” the microfractures formed during extension. As a result, 
the resistance of rock salt to rupture becomes greater than 100 kgijmm’, 
i.e., it approaches the theoretical value. 

The presence of faults serves to decrease the effective area at which 
rupture occurs. In the final analysis, the force acting at the instant 
of rupture is determined by the number of broken interatomic bonds. 

The strength of various materials determined in this manner vary 
within narrow limits. Thus, a piece of thread or rubber band rup- 
tures at a stress of the same order of magnitude as that of a steel wire. 


256. Mechanical Properties of a Polycrystalline Material 


During the initial, elastic stage of deformation, the grains of a 
crystalline material are not deformed uniformly since they have 
various orientations relative to the line of action of the force. As a 
result, the elastic properties of a polycrystalline substance will differ 
from those of a monocrystal. 

However, in the plastic region, the behaviour of a polycrystalline 
substance may differ from the behaviour of a-monocrystal of the same 
substance to an even greater extent. A polycrystalline substance 
offers greater resistance to an external force. This is not surprising 
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since the development of plastic displacements in a given grain will 
be impeded by neighbouring grains in which slip planes are oriented 
entirely differently relative to the applied force. 

Moreover, in a polycrystalline material, there occurs an entirely 
new phenomenon: the turning of grains and the formation of tex-. 
ture. The turning of grains in drawing, rolling and other deforma- 
tion processes is determined by the tendency of each grain to become 
aligned in such a manner that slippage is facilitated, i.e., with its 
slip plane parallel to the applied force. If there are several slip planes, 
the grain assumes a position for which the effect of the slip planes 
is maximal. For example, in most metals having cubic all-sided 
face-centred cells, the grains tend to become arranged with the [111] 
directions parallel to the axis along which the material is drawn. 

Another new phenomenon occurring in such a material consists 
in the slippage of the grains relative to one another along an 
intercrystal layer. Sich displacement differs from crystal displace- 
ment and is more like viscous flow in thick liquids. 

A polycrystalline material may not rupture at the same value of 
stress as a monocrystal. In certain cases, the grains of a polycrystal- 
line material remain whole when the material ruptures. This occurs 
when intercrystal layers have weak mechanical properties. As a gener- 
al rule, the strength of a material increases with decreasing grain 
size. 


257. The Effect of Surface-Active Substances on Deformation 


Every solid has numerous ultramicroscopic structural faults, 
arising as the result of the thermal mobility of its atoms and the pres- 
ence of impurities and mechanical defects. These faults are distrib- 
uted throughout the volume of a solid and in many cases can be 
viewed as extremely minute embryonic microfissures. The basic 
characteristic of such embryonic microfissures is their ability to in- 
crease in size during the process of deformation of the material. 

Under the action of external forces, the size of these microfissures 
increases and a concentration of stress arises at their edges. This, 
in turn, further facilitates the growth of the microfissures. In the 
case of a brittle material, such an increase in the size of microfissures 
during a deformation process may result in premature rupture of 
the material. In the case of a plastic material (most metals) such an 
increase in the size of microfissures during a deformation process 
results in the formation of plastic displacements. When a body is 
in a three-dimensional stressed state, the microfissures have wedge- 
shaped cross-sections and are characterised by exposed surfaces— 
orifices and cul-de-sacs in which the fissures preserve their embry- 
onic nature. Actual fissures terminate in cul-de-sacs like a sharp 
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blade having a very large curvature, with a radius of curvature of 
the order of magnitude of the lattice spacing. When the deforming 
forces are removed—in the range of elastic deformations—the micro- 
fissures gradually “heal” in reverse order, i.e., first the cul-de-sacs 
close and then the orifices. 

P. A. Rebinder has shown that the effect of the surrounding medium 
of a solid on its mechanical properties is not restricted to such cherni- 
cal action as corrosion and dissolution. The exposed surface of a sol- 
id is always coated with a thin film of a component of the surround- 
ing medium which is most closely related to the given solid. Such 
a substance may be a gas or vapour usually contained in air, or a sub- 
stance located in the vicinity of the solid. The molecules of a sub- 
stance clinging to the surface of a solid, or, as we generally say, ad- 
sorbed by a solid, are able to move along this surface and migrate 
from a region where there is an excess of such molecules to a region 
where there is a deficiency for complete coating of the surface. The 
tendency of an adsorbed layer to occupy all of the surface available 
to it is due to the fact that adsorption decreases the surface energy 
of a solid. Substances which may be adsorbed by the surface of a © 
solid are called surface-active substances. Various organic alcohols, 
acids, and salts of these acids, i.e., soaps, are highly surface-active 
substances with respect to metals. 

The strength of a solid decreases when it adsorbs a auinface: active 
substance. If a solid is ruptured in a medium containing even a 
small amount of a surface-active substance (for example, in a so- 
lution of oleinic acid in pure vaseline oil), the force required to rup- 
ture the material is less than under ordinary conditions. This is 
particularly evident in the crushing of soft rocks and in the subjec- 
tion of metals to a variable force or a force of prolonged duration. 

The effect of adsorbed molecules on the strength of a solid can be 
explained as follows. When molecules are adsorbed by the surface of 
a body, they penetrate the microfissures as a consequence of their 
mobility and tendency to occupy all of the exposed surface of the ad- 
sorbent. The drawing of adsorbed layers into a microfissure is due to 
the decrease in the surface energy of a solid caused by such penetra- 
tion. If an obstacle is placed in the path of an adsorption layer tend- 
ing to occupy a surface area of a solid which is not yet occupied, 
the adsorption layer will exert pressure on the obstacle. Within a 
microfissure, such an obstacle is provided by the molecules them- 
selves, i.e., their size prevents them from penetrating deeper into 
the microfissure. Therefore, at the boundary of an adsorbed layer 
within a microfissure, a pressure, which is directed so as to increase 
the size of the fissure in depth, arises. Adsorbed layers behave like 
wedges driven into the microfissures. Thus, the penetration of ad- 
sorbing molecules into the orifices of microfissures tends to create 
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additional disrupting forces. This is equivalent to increasing the 
external deforming forces, Therefore, the rupture of a solid in 
the presence of adsorbing substances is brought about by lower 
applied forces. 


258. Material Breakdown under the Action of a Stream of Particles 


The problem of material breakdown under the action of a stream 
of particles is of great importance in the construction of nuclear 
reactors. The materials of a reactor, including nuclear fuel, modera- 
tor, walls and instruments, are subjected to the action of neutrons, 
fission fragments, electrons, etc. Let us consider those actions of 
streams of particles which leave permanent effects. 

In the first place, we should mention particle collisions in which 
an electron providing a chemical bond between atoms is dislodged 
from its position. In such cases, ionisation results in the rupture 
of the bond. This bond is not necessarily re-established. Moreover, 
the ions or radicals which areformed may recombine in a different 
way. Therefore, in molecular materials, ionisation results in the 
breakdown of certain molecules and the creation of new ones. 

An atomic nucleus may also be displaced from its position. In 
such a case, it drags along its electronic cloud. Therefore, it can be 
said that an entire atom, rather than just a nucleus, has been dis- 
lodged from its position. Such an effect of the action of radiation is 
almost always irreversible. 

Materials are damaged by radiation as a result of the displacement 
of atoms from their positions and the rupture of chemical bonds. 
Atoms are dislodged under the action of heavy charged particles and 
fast neutrons. Chemical bonds are ruptured under the action of slow 
neutrons, y-rays and electrons. 

Let us consider more carefully what occurs when atoms are dis- 
lodged from their positions in solids. The process of displacement 
of atoms is a chain process. This means that the first displaced atom 
displaces another atom which is located in its path; the latter is 
able to. displace a third atom, etc. By means of such a chain process, 
a single fast nuclear projectile is able to produce considerable distor- 
tion in the crystal lattice of a solid. The nature of the distortion 
varies considerably from case to case. A crystal lattice may be com- 
pletely destroyed. Foreign atoms may penetrate between atoms of the 
primary lattice. Also possible are processes involving the substitu-s 
tion of atoms of the primary lattice by projectile atoms. The num- 
ber of displacements per charged particle for one element does not 
differ greatly from the number for another element. An a-parti- 
cle having an energy of 5 Mey, or a proton having an energy of 
20 Mev, dislodges 60-80 particles. 
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These figures suggest that the heavier the particle, the greater the 
damage. Analogous figures for fission fragments of uranium-235 
or plutonium-239 nuclei are considerably more unexpected. Such a 
pair of fragments produces, for-example, 25,000 displaced atoms in 
uranium and 8,300 displaced atoms in graphite. 

The process of slowing down a neutron from its initial velocity 
to thermal velocity also does not proceed without causing damage 
to materials. The slowing down of a neutron causes 450 atoms to be 
displaced in beryllium, 1,870 atoms in graphite and 6,030 atoms in 
aluminium. 

It is evident from these figures that significant changes in the prop- 
erties of a crystal lattice are to be expected as a result of the dis- 
placement of atoms from their positions in the lattice. Metals are of 
primary interest in this connection. The reason for this is that the 
sole permanent effect of radiation in metals is the displacement 
of atoms. 

The action of neutrons and fission fragments has been studied most 
carefully. This is not surprising since such investigations are of bas- 
ic importance in the design of nuclear reactors. The effect of a dose 
of 10’* neutrons per sq cm has been studied in detail. Such a dose is 
not very large. Generally speaking, the materials of anuclear reactor 
are subjected to such a stream of neutrons during each day of opera- 
tion. However, even in the case of such a small dose, the properties 
of metals undergo important changes. These changes approximate 
those which occur in the cold working of metal. Thus, under the action 
of neutrons and fission fragments a metal’s brittleness and hardness 
increase, its ductility decreases, and its electromagnetic properties 
also change. : 

Radiation damage in metals consists mainly in the displacement 
of atoms from their positions, but in organic materials, where the 
atoms are connected by chemical bonds, the changes consist mainly 
in the rupture of such bonds as the result of ionisation. Organic ma- 
terials are very rapidly broken down under the action of radiation. 
A. dose of the order of 10'® neutrons per square centimetre practical- 
ly disintegrates an organic substance. In a reactor, paraffin, olefin 
and polypheny! sustain a damage of 25 per cent within several hours. 

The transformation of an organic material usually consists in the 
liberation of a gas and in polymerisation. However, it should be nat- 
ed that in certain cases high-polymer materials are depolymerised 
under the action of radiation. 


CHAPTER XXXV 
DIELECTRICS 


259. The Relationship Between Permittivity 
and the Polarisability of a Molecule 


In a number of cases, particularly in gases, the molecules of a 
substance do not interact with one another. Hence, the electrical 
properties of such a substance are determined by the average behav- 
iour of one of its molecules. Molecules do not interact with one another 
in many dilute solutions as well. Occasionally, molecular interaction 
plays a secondary role even in a concentrated solution. 

Therefore. consideration of the electrical properties of a substance 
which consists of a large number of noninteracting molecules is of 
considerable importance. 

The dipole moment of a unit volume of dielectric, P, is determined 
by the permittivity e and the field intensity E in accordance with 
the formula 





e—l 
P= in E 


(see p. 253). On the other hand, the polarisation vector P is equal to 
the sum of the dipole moments in a unit volume of dielectric: 


P=ip 
or 
P=Np, 


where N is the number of molecules in such a unit volume and p 
is the “contribution” of each molecule to the polarisation vector. 
If E’ is the field intensity which acts on a molecule, then 


p=BE’, 
where f§ is the polarisability of the molecule. 

- It would seem that the relationship between 6 and e should now 
be given by the expression e=1 + 4nNB. However, this is not so, 
and that is why the field intensity in the above formula was denoted 
by E prime. The equations relating P and E and p and E’ involve 
different field intensities. E is the force acting on a unit test charge; 
such a charge does not distort the existing field. E’ is the field pro- 
duced by all the molecules (with the exception of the given one) 
acting upon the given molecule, 
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On p. 258, it was indicated that the field inside a dielectric sphere, 
E,, is related to the external field in which this sphere is located as 
follows: 


4 
E,=E,—3 NP. 


It is evident that the field in a spherical cavity which is cut in a di- 
electric may be determined by changing the sign of P: 


E,=E,+4aP. 


It may be rigorously proved that E’, the field due to all the mole- 
cules of a gas (except the one acted upon), is equivalent to the field 
in a spherical cavity. Thus, 


E’ =E+40P. 
The relationship between B and e may now be determined. Equating 
the derived expressions for P, we obtain 


e—l a ’ 
iz E=Np=NBE’. 





e—l 
4m 





Now, substituting B’=E+4 A E, we obtain the socalled 


Clausius-Mossotti formula: 


e—l 4x 
e+2 mea NB. 


If each member of the equation is multiplied by , where M is 





the molecular weight and @ is the density, the resulting expression 
will depend only on the polarisability B. Thus, N = Nay = 6.02 X 


x 10?* (Avogadro’s number). 
The quantity 
e—I1M 4x 
P= 99 =F Nab 

is called the molecular polarisation. To determine the molecular pol- 
arisation, one must first measure the permittivity of a substance as 
the ratio of the capacitance of a condenser filled with the substance 
under investigation to the capacitance of the condenser with the di- 
electric removed. Capacitance is usually measured by means of a 
bridge. Such bridges are constructed for the range 30 cps to 300,000 
cps. However, bridges may be constructed for frequencies up to 
40 mc/s. 
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A wide variety of permittivity meters (instruments for measuring 
®) are employed. Such instruments are very sensitive and enable us 
to measure permittivity with a high degree of accuracy. Indeed, ex- 
cellent results may even be obtained with gases having a pressure of 
the order of 1 mm of Hg. 

Using the formula ¢ =n’ (see p. 328), we can derive an index of re- 
fraction equation which is analogous to the molecular polarisation 
equation: 
nr’—1M 
WZ Q 





4 
R= =F NB. 


This characteristic of a molecule is called molecular refraction. 
Measured values of R and # for different frequencies of electromag- 
netic vibrations may differ considerably from one another. 
Despite the fact that the derivation of these formulas presupposes 
a gas, molecular interaction evidently changes matters little. In 
any case, the formulas for R and # are widely used in the investiga- 
tion of dilute solutions as well. 


Examples. Let us consider benzene, C,H,(e=2.28; 9=0.88gm/cm*; M78), 
and water (e=81; e=1 gm/cm*; M=18). Assume that the plates of a flat con- 
denser, which creates an electric field E=300 v/em=1 Gaussian unit, are im- 
mersed in these liquids. 

1. Let us calculate the polarisation (the electric moment of a unit volume 
of dielectric) of benzene and water: 


-1 2.28—1 . : 
Poenzene= Gr E =a 1=0.1 Gaussian unit; 


P water 6.4 Gaussian units. 


The contribution of each molecule to the polarisation vector is 


p=—, where N= A408 is the number of molecules per unit volume; 


Poenzene=1.5X107?5 Gaussian unit; 
Pwater=19.4x 10728 Gaussian unit. 


2. Let us calculate £’, the intensity of the electric field due to all mole- 
cules (except the one acted upon): 


, 
4 : : 
_Evenzene=E +- ™Ppenzene= 1.43 Gaussian units; 
, 


Ewater= 27.8 Gaussian units, 


i.e., the field in water is about 28 times (!) as great as the applied field. 
Now, the polarisability of molecules of benzene and water can be determined: 


Boenzene = f= 1.05x107**,  Bwater=0.7x 10-77% 
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3. From measurements cf © by means of a permittivity meter, we can 
M 


calculate the molccular polarisation P= are 52 





Poenzene = 26.6 Gaussian units; 
Pwater= 17.3 Gaussian units. 


4. Measurements of the index of refraction n by means of a refractcmeter 
yield Ngenzene= 1.5014 and eager OO Using these values, we can calculate 


the molecular refraction R=——~ 


m~2 po rae 


Poenzene= 26.1 Gaussian units; 
Pwater= 3.6 Gaussian units. 


Jt is seen that in the case of benzene #=R and in the case of water the val- 
ues of # and R differ considerably. The reason for this will be explained in 
the next article. 


260. Polarisation of Polar and Nonpolar Molecules 


Polarisation of a substance under the action of an electric field 
may occur for two reasons. First, thecentre of gravity of the electron- 
ic cloud may be displaced (inherent polarisability). Secondly, the 
field has an orienting action which may turn molecules having a 
constant, or rigid, dipole moment closer to the direction of the field. 
Therefore, it is customary to divide polarisability into two parts: 
a—inherent polarisability and b—orientation polarisability. 

A molecule must be turned as a whole in order for the dipole to 
become oriented. Owing to the inertia of a molecule, such turning 
requires a certain amount of time. For rapid electromagnetic vibra- 
tions, a rigid dipole cannot follow the field. Therefore, in the case 
of light waves, the orientation polarisability 6 is absent. 

Thus, 


GSN e (a +6) and R=EN 4,0. 


The polarisability a of a molecule can be determined by measuring 
the index of refraction. If, in addition, # is also measured, the orien- 
tation polarisability b is obtained by subtraction. 

The magnitude of the orientation polarisability is directly related 
to me Tigid dipole moment p of a molecule. We shall show that 


b= 
sar 
Gas molecules are . randomly oriented as a result of chaotic thermal 
motion. In the absence of a field, the assumption of any direction by 
the dipole moment p of a molecule is equally probable. The situation. 
changes if a field E is applied. The potential energy of a dipole 
is equal to e(p,—@_), where @, and p_ are the potentials of the field 
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at the ends of the dipole, i.e., 
— ej = —pE= — pE cos, 


where 6 is the angle between the field vectors and the dipole moment. 
A dipole oriented in the direction of the field has a minimum.energy. 
This energy is equal to —pE. Thermal motion prevents all dipoles 
from assuming a position of minimum energy. A certain compromise 
distribution is established between the tendency to maximum entropy 
and the tendency to minimum energy (see p. 624). The Boltzmann law 
is an expression of this compromise. The probability that the energy 


of a molecule lies between U and U + dU is proportional to or dU. 


In our case, U=—pE cos 6. Therefore, dU=pE sin 0 d0. The frac- 
tion of molecules the er moments of which lie between § and 


6+ dh will be-.equal to ees oe a Hab. 
For ordinary temperatures, pE< kT. i for extremely strong 


fields of the order of 10° v/cm, the ratio 7 aT will be of the order of 


0.01 (the order of magnitude of dipole moments is 10~** Gaussian 
unit), Therefore, we can use the approximation e*~1 + x, and the 
fraction of molecules sought will be equal to 


const (1 +22 cos 0 ) sin 0 d4. 


The integral of this expression from 0 to m should equal unity from 
the probability viewpoint, since for any molecule the direction of p 
lies somewhere between 0 and a. Then, as can be easily verified, the 
constant is equal to 5 and the fraction of molecules the polarisation 
vectors of which lie in the interval 6 to 6+ d0 will be equal to 


(i fee cos 0) sin 6 dd. 


The projection of the dipole moment on the direction line of the 
field is pcos $. If NM is the number of molecules per unit volume, 
the fraction contributed to the polarisation vector by molecules in- 
clined at an angle 6 to the field will be equal to 


eNe (1 +6 va E cos 0) sin0 cos 6 0. 


The polarisation vector P can be determined by integrating this ex- 
pression from 0 toa. We obtain 


P= =N be Ei 
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hence, the orientation poleieabinty is given by the formula 


b=zen- 


The relationship between molecular polarisation and temperature 
is expressed by the ay 


2 


This theoretical conclusion is in excellent agreement with experi- 
mental results. By measuring # as a function of 7, we can easily de- 
termine the two parameters which describe the electrical properties 
of a molecule, viz., polarisability and the “rigid” dipole moment p. 

Thus, the values obtained for a by measuring R can be used to 
determine p by substituting in the expression for ?. 

Experiments indicate that in certain cases the interaction of di- 
poles of neighbouring particles may result in significant changes in 
permittivity as compared with the value of ¢ for a system of noninter- 
acting molecules. This can be shown by measuring e in a liquid and 
in a gas formed of the same molecules. 

The interaction of particles also affects the permittivity of crystals. 

As arule, electric polarisation in crystals occurs only as the result 
of the deformation cf electronic clouds and the displacement of ions. 
No orientation polarisation occurs since, by and large, molecules can- 
not turn in a crystal. 

In meny ionic crystals, the index of refraction squared is consider- 
ably less than the permittivity. (For example, the values for rock salt 
are 2.37 and 6.3, respectively, for titanium dioxide 7.3 and 114, 
and for lead carbonate 4.34 and 24.) In such crystals, the electronic 
cloud is deformed and, in addition, the ions are displaced as a whole 
under the action of a static field. On the other hand, it has been estab- 
lished that in molecular crystals the permittivity is equal to the square 
of the index of refraction. This indicates that polarisation is due 
exclusively to the deformation of the electronic cloud. 

Since orientation polarisation is absent, permittivity varies very 
little as the temperature changes. 

It has already been indicated that in the case of a rapidly varying 
field there is no orientation polarisation and the molecular polarisa- 
tion becomes equal to the refraction. It is important to know which 
field oscillations should be considered rapid. This can be determined 
if the relaxation time is known. When the relaxation time t is 
much greater than the oscillation period, there is no orientation 
polarisation. 

The relaxation time t was discussed on p. 163. If a dielectric is in 
a constant field, its dipoles assume at equilibrium orientation dis- 
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tribution which depends on the temperature. When the field is 
switched off, the dipoles become disoriented. However, this does not 
occur instantaneously, i.e., the order decreases in accordance with an 
exponential law. The rate of this decrease is described by the relaxa- 


tion time t—the time in which the polarisation decreases to + of its 


original value. If t is much greater than the oscillation period, the 
direction of the external field changes before the dipoles change their 
orientation. A very rapidly varying field does not affect the behaviour 
of the dipoles at all. If t<T, at each instant the state will be in equi- 
librium and the polarisation will closely follow changes in the field. 
For most dielectrics, the relaxation time is of the order of 10~**- 
10-** sec. 

Examples. 1. Using the results of the example on p. 651, let us determine 
the values of the inherent polarisability a and the orientation polarisability 6 
for benzene and water. 

a=8R/40N gy, whence Apenzene= 107% Gaussian unit; 
Ayater= 0.14 107** Gaussian unit. 
On the other hand, 
a+b=3P/4aN ay, (A+ 6)yenzene= 10~7% Gaussian unit; 
(A+b)waterp== 0.7 x 10728 Gaussian unit. 

It follows that 

bponzene=9 and Dygtep=0.7x 107% —0, 14x 10778 = 0,56 x 107*8 Gaussian unit. 


This means that a benzene molecule does not have a rigid dipole moment, but 
a water molecule does. 

2. Let us determine the rigid dipole moment of a water molecule from the 
formula p= Y 3k70. If the molecular polarisation @ and the molecular refraction 
R are measured at room temperature (T=300°K), 


p=V 3x1.38x 107 x300x0.5x 10-8 =0.8x107* Gaussian unit, 


This value is in close agreement with experimental results. 

Frequently, the unit 1 debye=107!* Gaussian unit is used as a measure of 
dipole moment. This unit is named after the German scientist Debye, who 
developed the theory of dipole moments. 





261. Additivity of Molecular Refraction 


The refraction R is a molecular constant. R does not depend on 
the density or phase of asubstance (this has been demonstrated exper- 
imentally), nor on the temperature of the given substance. A conve- 
nient property of refraction is its additivity. If it is possible to compile 
a table of property increments* for all possible atoms, and if the 


* These are the contributions of a given atom to the value of a given phys- 
ical quantity. 
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magnitude of the property is determined as the sum of the increments, 
such a property is said to be additive. The additivity of R can be 
used for analytical and identification purposes. (It should be noted 
that this additivity possesses no theoretical basis and in a number of 
cases there occur significant deviations from the ideal.) . 

Innumerable observations have been processed by numerous inves- 
tigators, and tables of R increments have been compiled. (Most of 
these tables are for R, increments, i.e., measurements of the index 
of refraction for the so-called D-line, the yellow line of sodium.) 
For example, for C, H and Cl atoms, the increments are equal to 
2.418, 1.100 and 5.967, respectively. By means of these values ‘alone, 
one can predict the molar refraction of many compounds: 


methane, CH,: R=2.418+ 4 1.100; 
chloroform, CHCl,: R = 2.418 --1.100+3 5.967; 
carbon tetrachloride, CCl: R==2.418-+ 4 5.967, 


etc. Refractions can be measured with great accuracy and when neces- 
sary extremely small differences can be determined. 

In view of dispersion anomalies, which, as was explained earlier, 
occur at frequencies close to the nat- 
ural frequencies of absorption, refrac- 
tion should be measured in a region 
far removed from the absorption 
bands. 

Indexes of refraction are measured. 
by means of refractometers. , Most re- 
fractometers measure the angle of re- 
fraction of a beam of light (emerging 
from a material under investigation) 
which impinges on the surface of a 
prism made of glass with a’ high- 
er fn. 

If a bundle of rays with an angle of 
incidence of 0° to 90° reaches the 
boundary between the material under 
investigation and the glass, the re- 
fracted rays will lie between 0° and 
a certain critical angle a the sine of 
which will be equal to the ratio of 
the index of refraction of the mate- 
rial under investigation to that of 
the glass of the prism (see Fig,:286). 
The critical angle is indicated by a 

- .-.,.>77" . «sharp linein the focal-plane of the 
Fig. 286 tube. 
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To measure the index of refraction of a liquid, we must place a 
0.5-mm. layer of iton the surface of the prism. When measuring sol- 
ids, the material must make close contact with the surface of the 
prism. Optical contact is achieved by using drops of an appropriate 
liquid between the surface of the prism and that of the material 
under investigation. The index of refraction of a powder may be 
determined by immersing the powder in a liquid whose index of 
refraction is the same as that of the powder. 


262. Pyroelectric and Piezoelectric Materials 


A crystal which does not have a centre of inversion included in 
its symmetry elements may possess a number of interesting proper- 
ties. Such crystals may have an electric moment (polarisation vector) 
in the absence of an external field. 

First, let us direct our attention to crystals which become polarised 
with homogeneous deformation.* This property is characteristic of 
piezoelectric crystals, which were discussed 
in Sec. 45. 

The occurrence of polarisation on com- 
pression, extension, etc., shows that homo- 
geneous deformation results in the creation 
of a special, i.e., single (not multiplied 
by the number of symmetry elements) di- 
rection. Such behaviour is not possible 
when a crystal has a centre of inversion. 
Homogeneous deformation cannot elimi- 
nate a crystal’s centre of inversion. At the 
same time, the existence of a centre of. 
inversion is incompatible with the exist- 
ence of a special direction such as the 
polarisation vector direction. Any crystal 
that does not have a centre of symmetry 
may possess piezoelectric properties. Never- Fig. 287 
theless such properties are not found in 
many crystals of this type. This may be due to the fact that 
instruments are not sufficiently sensitive. However, we may con- 
ceive of a noncentral symmetric structure in which a homogene- 
ous deformation does not displace the centre of gravity of posi- 
tive charge relative to the centre of gravity of negative charges 
Close examination shows that the piezoelectric effect is not possible 
in one of the noncentral symmetric groups of symmetry. 





* This is a deformation in which all volume elements are deformed in the 
same manner. 


22—3580 
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The most common piezoelectric material is quartz. Fig. 287 
oe how piezoelectric plates may be cut from a quartz crys- 
tal. 

The nature of atom displacements can be assessed from Fig. 288. 
The structure of quartz may be pictured as a compact packing of 
oxygen ions in the empty spaces of which silicon atoms are located. 
(In the figure, silicon atoms are represented as black spheres and oxy- 
gen atoms as white spheres.) The oxygen ions carry a negative charge 
and the silicon ions a positive charge. A silicon atom is surrounded 
by four oxygen atoms. Electrisation of quartz occurs when it is com- 
pressed along the polar axes. Pressure applied along axes lying in the 
plane of the figure results in the displacement 
of its positive charge relative to its negative 
charge. Pressure along an axis of the third 
order (a nonpolar direction perpendicular to 
the plane of the figure) is ineffective. 

Atom displacements cannot be shown in the 
figure. These displacements are quite negli- 
gible and cannot be detected by objective 
is methods (e.g., X-ray structural analysis). 
Fig. 288 The piezoelectric constant of quartz, i.e., the 
magnitude of the polarisation vector for unit 
pressure, is equal 10 6.5 x 107* Gaussian unit. The volume of a unit 
cell of quartz is equal to 112 x 10-** cm*. Since there are three SiO, 
molecitles in a cell, the number of molecules in a unit volume is 
equal to 2.7 x 10° and, therefore, the dipole moment per molecule 
for unit pressure is equal to 2.4 x 10-*° Gaussian unit. The charge 
of a molecule is equal to 14 + 2 x 8=380 electron units. Therefore, 
the displacement of the centre of gravity of positive charge relative 
to the centre of gravity of negative charge is a negligible quantity 
-of the order of 107-"* cm. 

Piezoelectric crystals include a class of materials known as pyro- 
electric crystals. Such crystals are naturally polarised under normal 
temperature and pressure. Usually this effect is masked by the free 
surface charge which accumulates along the boundaries of the crys- 
tal, but it may be detected when the temperature of the crystal is 
raised. Hence the designation pyroelectric (pyro means fire). 

Pyroelectric crystals have even more restricted symmetry. Only 
a crystal having a special axis can be termed pyroelectric. Thus, the 
mere absence of a centre of symmetry is insufficient. The significance 
of this condition is evident. The presence of natural polarisation 
indicates that such a special direction is present in pyroelectric crys- 
tals, while in the case of other piezoelectric crystal such a direction 
appears only under the action of mechanical deformation. One of 
the most common pyroelectric substances is tourmaline. 
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A pyroelectric crystal has a very strong internal electric field. 
Therefore, the superposition of an external field does not change the 
polarisation of such a crystal, i.e., the polarisation cannot be increased, 
decreased or rotated. Such a crystal is polarised to saturation— 
all particle dipole moments are parallel. In the theory of ferromagnet- 
ism (see Sec. 266, which may be read with profit at this point), 
a region in which the magnetic moments of atoms are parallel is 
called a domain. The same term is applied to a region in which the 
electric dipole moments of all particles are 
parallel. A pyroelectric crystal usually 
constitutes a single domain, but there are 
exceptions. A number of substances may 
yield multidomain crystals. Such pyro- 
electric substances are known as Seignette 
electric, or ferroelectric, substances. The 
former designation is derived from Seig- 
nette salt, in which B. V. Kurchatov and 
P. P. Kobeko first detected this effect. The 
latter designation emphasises the close sim- fa 08 Ort 
ilarity between these substances and fer- Fig. 289 
romagnetic ones. Both ferromagnetic and 
ferroelectric substances have very large dielectric constants (values of 
several hundred or even thousand), pronounced hysteresis effects, and 
Curie points. The discussion of Sec. 266 is completely applicable here 
and will not be repeated. The points made about the eflect of a field, 
polarisation by the displacement of domain boundaries, and the 
reasons for the division of a crystal into small domains all apply to 
ferroelectric materials as well. 

A large number of ionic pyroelectric crystals are ferroelectric. 
Several of these are particularly suitable to demonstrate ferroelectric 
properties. Barium titanate, BaTiO,, is typical in this respect. 
At 120°C, barium titanate loses its special properties and becomes an 
ordinary dielectric. At temperatures above 120°C, this substance has 
the simple unit cell shown in Fig. 289. The cell is cubic; at the centre 
there is a titanium atom, at the corners of the cube barium atoms, and 
at the face centres oxygen atoms. The cell has a central symmetric 
structure; above 120°, thecrystals no longer exhibit pyroelectric prop- 
erties. When the temperature is reduced a phase transition occurs 
and the structure changes: one of the cube edges becomes | per cent 
longer than the other two and the cube is transformed into a tetrahed- 
ron. In this process, the titanium atom is displaced in the direction 
of one of the oxygen atoms. This now becomes the special direction 
and the polarisation vector will be parallel to this line. A barium 
titanate crystal has three directions of weak polarisation, rather than 
one, since the displacements along the three axes of the cube are equal. 





22* 
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When the substance is cooled below the Curie point (120°C for a 
barium titanate crystal) different regions of the crystal may be trans: 
formed into domains with different orientations. A crystal which has 
acquired a domain structure is in a state of mechanical stress, i. e., 
some portions of the crystal are compressed and others extended. 
Strictly speaking, a domain crystal is not amonocrystal—three-di- 
mensional long-range order throughout the crystal is no longer present. 

On further reduction of temperature, barium titanate undergoes 
yet another phase transformation at about +10°, but it does not cease 
to be ferroelectric. 

Seignette salt behaves differently. It possesses ferroelectric proper- 
ties only within a narrower temperature interval—from —20°C to 
+24°C, 


CHAPTER XXXVI 
MAGNETIC SUBSTANCES 


263. Three Groups of Magnetic Substances 


Substances may be classed into three groups in accordance with 
their magnetic properties: diamagnetic, paramagnetic and ferromag- 
netic. The values of diamagnetic susceptibility lie in the range of 
--13 x 10~* (bismuth) to—0.8 x 107° (copper). Paramagnetic bodies 
are characterised by positive susceptibility—for example, 0.4 x 
Xx 107° (potassium) and 320 x 10-° (iron chloride). Ferromagnetic 
bodies are characterised by large values of permeability. These 
are hundreds and even thousands of times greater than those of other 
bodies. Let us examine the structural features which explain these 
great differences in magnetic properties for substances which otherwise 
do not show great differences in properties. 

Diamagnetism, it will soon be seen, is a universal property of all 
bodies inasmuch as they consist of electrons. The above values show 
that diamagnetic properties are weaker than paramagnetic ones and, 
a fortiori, weaker than ferromagnetic properties. Diamagnetic proper- 
ties may be detected only in the absence of properties resulting in 
positive magnetism. Paramagnetic and ferromagnetic bodies have 
diamagnetic properties, but they are obscured by the stronger positive 
paramagnetism. Thus, diamagnetism exists for any system containing 
electrons. On the other hand, positive magnetism arises only in bodies 
the atoms of which possess a magnetic moment. The phenomenon of 
paramagnetism is very similar to the process of electrisation of a 
dielectric, which consists of rigid dipoles possessing aconstant dipole 
moment. 

The presence of a magnetic moment in atoms is also a necessary con- 
dition for the existence of ferromagnetic properties. However, the 
peculiarities of ferromagnetic substances are due to a very specific 
property, viz., the formation within a body of vast regions—domains 
—within which the magnetic moments of thousands of millions of 
atoms are arranged parallel to one another. 


264. Diamagnetism 


Diamagnetism is a direct consequence of the tendency for an elec- 
tron to move in a circle in a magnetic field. 
In a magnetic field with an induction B, an unbound charged 


particle moves in a circle with an angular frequency one, It can 
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be rigorously proved that the action of a magnetic field on an elec- 
tron moving in acentral field—in particular, in the field of an atomic 
nucleus—produces an analogous effect: the electron will move in a 
circle about a line of force, but at one-half the frequency, viz. 
This motion is superimposed on other motions which may be per- 
formed by the electron: the chaotic motion of particles of the electron 
gas or the motion of the electron about an atomic nucleus. 

The fundamental considerations discussed on p. 492 showed that 
such motion may be equated to a circular electric current. When the 
magnetic field is switched on, the electrons begin to rotate about the 
magnetic field and each produces an ele- 
mentary current 

ve eo 
are 


Multiplying this value by the area of the 

circle described by an electron in its mo- 

tion about a line of force, we obtain the 

value of the diamagnetic moment created 
by one electron: 

1 ew e 
The reason for the minus sign is clear 
from Fig. 290: the direction of the mo- 
Fig. 290 ment is opposite to that of the field. 

When a system consists of a large 

number of electrons, we must take the summation of the above 
expression with respect to all the electrons: 


M=—zo3 OSB. 


Since by definition (see p. 287) magnetic susceptibility is equal to the 
ratio of magnetic moment per unit volume (or unit mass or mole) 


to induction, 
Ne xy 
X= ~ amet XS. 


If N is Avogadro’s number, y represents molar diamagnetic sus- 
ceptibility (in comparing with the results on p. 287, note that y= =) ‘ 
Thus, x is given by the areas circumscribed by electrons in their 
secondary motion in themagnetic field. In principle, this computation 
can be made if we know the wave function of the system, i.e., in 
the final analysis, the electron density. Actually, since the computa- 
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tion is very cumbersome, the diamagnetic susceptibility is determined 
experimentally. 

It should be emphasised that diamagnetic susceptibility is deter- 
mined by the electron structure of the system and does not depend 
(at least for atoms and molecules) on external conditions, including 
temperature. 

Diamagnetic susceptibility, like molecular refraction, possesses 
additivity. If the diamagnetic susceptibility is taken for a mole of 
substance, the susceptibility y of a molecule may be expressed with 
considerable accuracy as 


x => Naka 


where n, is the number of atoms of type A in the molecule and y, 
is the increment for the given atom. For purposes of illustration, we 
can use the same example as for refraction (see p. 656). C,H and Cl 
atoms have the increments 7.4, 2.0 and 18.5 (y,X 10°), respectively. 
Thus, we obtain 15.4 for methane, 64.9 for chloroform, and 81.4 for 
carbon tetrachloride. These values are in close agreement with experi- 
mental results. 

The significance of this additivity consists probably in the follow- 
ing: outer electrons weakly affect diamagnetic susceptibility, In 
so far ‘as additivity is realised, diamagnetic susceptibility is an 
atomic rather than a molecular property. 

Diamagnetic susceptibility, as indicated in the preceding article, 
is a property associated with substances the atoms and molecules 
of which do not have a constant magnetic moment. Such particles 
include in the first place atoms and ions with completed shells— 
the ions F~, Cl~ and Nat and atoms of the noble gases. Atoms and 
ions which in addition to a completed shell contain two more s-elec- 
trons with anti-parallel spins, e.g., Zn, Be, Ca and Pbt*, are also 
diamagnetic. 

The group of diamagnetic molecules is incomparably larger than 
the group of paramagnetic molecules. The latter exists more in the 
nature of exceptions. This is due to the fact that practically all mole- 
cules have valent bonds formed by a pair of electrons with anti-par- 
allel spins. Usually, the total moment about a nucleus, as well 
as the spin moment, equals zero in such molecules. Thus, bodies 
consisting of atoms and ions such as those cited above and practical- 
ly all bodies the building blocks of which are molecules—thereforey 
practically all organic substances—are diamagnetic. 

Diamagnetic susceptibility describes the electron cloud of a mole- 
cule. If the distribution of electrons in a molecule is strongly anisotrop- 
ic, its magnetic susceptibility is also anisotropic. The anisotropy of 
diamagnetic susceptibility is manifested particularly in molecules of 
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the aromatic compounds. For example, in benzene, y,, the molar dia- 
magnetic susceptibility in a direction lying in the planeof a benzene 
ring, equals —37 x 10~* cm*/mole and y, the molar diamagnetic sus- 
ceptibility in a direction perpendicular to the plane of a ring, equals 
—91 xX 107° cm*/mole; in naphthalene yy=—40 x 107-*° cm*/mole 
and x; =—190 x 10~° cm*/mole. Anisotropy may be detected by 
measuring crystals oriented in different directions in the field. Meas- 
urements of powders, liquids and gases yield a value of magnetic 
susceptibility for an averaged orientation. 


265. Paramagnetism 


A substance has paramagnetic properties if the atoms, ions or mole- 
cules of which it consists possess a magnetic moment. A magnetic 
moment is due either to the uncompensated spins of electrons in the 
atomic system or to the motion of electrons about nuclei, or both. 

As was explained earlier (see p. 496), a magnetic moment resulting 
from spin is related to angular momentum as follows: 


1, = 20, VEGF), 


and a magnetic moment resulting from the motion of electrons about 
a nucleus is related to angular momentum as follows: 


Mp =baVL(L +1). 


Here, 1, is the Bohr magneton and s and L are, respectively, the total 
spin momentum and the total angular momentum for motion about 
a nucleus, taken for an atom or molecule as a whole. As previously, 


sand L are expressed in units of A When paramagnetism is due to 


both effects, the formula for the magnetic moment of an atom or mol- 
ecule takes the form 


w= gu,VI(SF)), 


where J/ is the quantum number of the total quantum momentum, i.e., 
the vector sum of L and s, and gis the Lande factor, which depends on 
all three quantum numbers. Incidentally, the proximity of g to | or 2 
(established experimentally) is an excellent indicator of the origin 
of the magnetism of a given substance. 

Paramagnetic atoms and ions include particles having one electron 
over and above a completed shell (e.g., atoms of the alkaline metals), 
atoms of the transition elements, ions of the rare earth elements 
with incomplete shells, etc. 

Most molecules, as already indicated, are diamagnetic. Molecules 
of oxygen and sulphur, which are paramagnetic, are exceptions and 
have a total spin equal to 1. The magnetic moment obtained experi- 
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mentally is in close agreement with the value calculated by means 
of the formula p=2,V 2. 


The presence of paramagnetism is proof of the fact that the mole- 
cules contain unpaired electrons. Thiscircumstance makes the measure- 
ment of the magnetic properties of molecules of great interest to the 
chemist. The so-called free radicals, which are chemical compounds 
with an unpaired electron, possess paramagnetic properties. Free 
radicals are created in a number of instances in chemical reactions, 
and the measurement of magnetic susceptibility is a possible method 
of studying the course of chemical reactions. 

How is the value of the paramagnetic moment of a molecule relat- 
ed to that of magnetic susceptibility? In paramagnetic bodies located 
outside a magnetic field, the magnetic moments are distributed ran- 
domly with respect to direction, and the total magnetic moment of a 
substance is equal to zero. When a field is switched on, the atoms 
(or molecules) will tend to rotate in such a way that their magnetic 
moment coincides with the direction of the field. As a result, equilib- 
rium is established between two tendencies: the ordering action of the 
field and the tendency to thermal randomness. The reasoning used on 
p. 653 to derive the value of the polarisability of a substance consist- 
ing of rigid electric dipoles is completely applicable here. Therefore, 
like in that case, the relationship between the magnetic moment of an 
atom (or molecule) and the paramagnetic susceptibility of an atom 


is given by the expression 
2 


Xatom = Ay . 


In contradistinction to diamagnetic susceptibility, the paramagnet- 
ism of a substance depends on temperature. To be sure, the situation 
here is somewhat more complex than in the case of dielectrics. This 
is due to the fact that the electric moment of a molecule is a constant, 
while the magnetic moment of a molecule (or atom) may vary consid- 
erably with the temperature. Paramagnetic moment is related 
to quantum numbers, and the distribution of molecules according to 
state may depend greatly on temperature. Therefore, the simple law 
that magnetic susceptibility is inversely proportional to temperature 
(the Curie Law) may not be valid in the case of paramagnetic sub 
stances, 


266. Ferromagnetism 


Domain. A small number of substances possess marked (using coarse 
observation methods) magnetic properties. These substances in- 
clude iron, cobalt, nickel, gadolinium, compounds of these elements, 
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and certain compounds of manganese and chromium. Since iron is 
ie most important of these, such substances are said to be ferromag- 
netic. 

Atoms of a ferromagnetic substance have amagnetic moment, which, 
moreover, is caused by spin (at least, basically). However, it is not 
this feature that distinguishes it from a paramagnetic substance. 
The main characteristic of a ferromagnetic substance is its domain 
structure. A domain is a region 
which is magnetised to saturation, 
2 i.e., a region in which all atoms 
= are arranged with their magnetic 
; moments parallel. Since the linear 
dimensions of domains are usually 
of the order of 0.01 mm, they may 
be observed by means of an ordi- 
nary microscope. 

Domains exist in a ferromagnetic 
substance in the presence as well 
as in the absence of a field. In or- 
der to observe domains, we place a 
drop of colloidal suspension—a 
finely divided substance such as 
magnetite (Fe,O,)—on the polished 
surface of a ferromagnetic monocrystal. Colloidal particles become 
concentrated close to the boundaries of the domains since strong local 
magnetic fields exist along such boundaries (as in the case of any bar 
magnet) and they attract the grains of magnetite (see Fig. 291). 

First, let us consider certain problems arising in connection with 
one domain; then we shall study the arrangement of domains in crys- 
tals; and finally we shall examine the process of magnetisation of a 
ferromagnetic substance. 

The orientations of the magnetic moments of atoms forming a single 
domain are not arbitrary. Every crystal of a ferromagnetic substance 
has a particular crystallographic direction along which it is most easi- 
ly magnetised. In hexagonal cobalt this is a single direction—the hex- 
agonal axis. In cubic iron this direction is the edge of a cube. This 
means that there are three directions of easiest magnetisation and 
accordingly three directions of magnetic moments of domains. 
In cubic nickel the spatial diagonals of a cube are axes of easiest 
magnetisation, i.e., there are four possible directions of magnetic 
moment. 

Why is it that atoms in a ferromagnetic substance arrange them- 
selves so that their magnetic moments are parallel? This is caused by 
a specific phenomenon—the interchange of positions by electrons. 
As indicated in connection with a chemical bond, the overlapping of 
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wave functions results in a decrease in energy. Electrons occupy 
a common space and are able to interchange positions. The tendency 
of exchange energy to become minimal is the reason for the stability 
of most chemical compounds. Exchange energy plays an analogous 
role in the creation of a domain. In the case of a chemical bond, the 
minimum value of exchange energy is achieved when the spins of 
interchanging electrons are anti-parallel. However, the general con- 
clusion of quantum mechanics is broader, i.e., the exchange energy 
may in certain cases be minimal for parallel orientation of spin and 
in other cases for anti-parallel orientation of spin. In ferromagnetic 
substances, the spins of atoms contained in a domain have a parallel 
orientation. Comparatively recently, a new class of compounds— 
anti-ferromagnetic substances — was discovered. In these sub- 
stances, stable domain states occur for anti-parallel orientation of 
spin. 

From measured values of the magnetisation of a domain, one can 
calculate the number of spins per atom involved in ferromagnetism. 
Such numbers are not whole numbers (for iron 2.2, for cobalt 1.7, 
for gadolinium 7.1, etc.). It must be concluded that to a certain ex- 
tent the electrons forming an electron gas are also involved in the 
creation of ferromagnetism. However, in the main, electrons bound to 
atoms are responsible for ferromagnetism. In iron, conduction elec- 
trons come from the outer 4s shell, while ferromagnetic electrons are in 
the 3d shell. 

The existence of renarkable materials known as ferrites constitutes 
direct proof of the absence of any connection between conduction 
properties and ferromagnetism. These materials are semiconductors 
with a specific resistance of 10 to 11 orders of magnitude greater than 
iron. Conduction electrons, of course, play no role in the magnetism of 
these substances. Ferrites are mixed compounds; for example, man- 
ganese ferrite is a 1 : 1 mixture of manganese oxide and iron oxide, 
and nickel ferrite is an analogous mixture of nickel oxide and iron 
oxide. Iron oxide contains two iron atoms, and nickel oxide one nick- 
el atom. A crystal of the mixture represents a compact packing of 
oxygen atoms. The nickel atoms and the two iron atoms fit into the 
empty spaces. It was indicated on p. 596 that there are two kinds of 
empty spaces in a compact packing arrangement, viz., tetrahedral 
and octahedral. An atom which fits into an empty space of the first 
kind is surrounded by four neighbours, while an atom in an octahet- 
ral space has six neighbours. The iron atoms fit into both kinds of 
spaces. The magnetic moments of the iron atoms are quite ordered, 
but the moments of iron atoms in tetrahedral spaces point in one di- 
rection while the moments of iron atoms in octahedral spaces point 
oppositely. As a result, the actions of these two systems of moments 
cancel each other and the magnetic properties of such a mixed oxide 
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result from the magnetism of nickel, the moments of whose atoms are 
all pointed in one direction. 

The presence of exchange energy explains the tendency of atoms 
to arrange themselves so that their spins are paratlel or anti-parallel. 
Apparently, in ferromagnetic 
substances, the exchange ener- 
gy of interaction becomes of 
prime importance and causes 
the substance to have a spin 
arrangement such that the 
energy assumes a minimum 
value. In the remaining par- 
amagnetic substances, other 
components of interaction 
energy do not allow the ex- 
change energy to make itself 
felt. 

The long-range order of 
atoms is destroyed at a cer- 
tain temperature: the crystal 
becomes fused. Temperature 
affects the arrangement of the 
magnetic moments in exactly 

Fig. 292 the same manner. Fig. 292 
shows schematically how the 
magnetic moments of the atoms behave when the temperature is 
raised. At first the vibrations are in phase, then disorder begins to 
prevail, and finally the magnetic order “melts away”. Beginning 
at a definite temperature called the Curie point, in honour of the 
outstanding French scientist Pierre Curie, the order in the arrange- 
ment of arrows disappears and the substance loses its magnetic 
properties, i.e., the ferromagnetic substance turns into a paramag- 
netic substance. For iron the Curie point lies at 770°C, for cobalt 
at 1,115°C, for nickel at 358°C and for gadolinium at 15°C. 

In an anti-ferromagnetic substance, the spin of atoms tends to 
assume an orderly, but anti-parallel, arrangement. The structure 
of a domain of manganese oxide, which is an anti-ferromagnetic sub- 
stance, is shown in Fig. 293. Arrows represent the moment of manga- - 
nese. From the figure, we see that the chemical period of structural rep- 
etition is one-half of the magnetic period. At absolute zero each atom- 
ic magnet of the anti-ferromagnetic substance is surrounded by atoms 
with oppositely directed moments. As in the case of a ferromagnetic 
substance, this order is destroyed at a definite Curie temperature 
oe above this critical point it behaves like a paramagnetic sub- 
stance. 
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The existence of various anomalies in the behaviour of a body in 
passing through the Curie point is indirect evidence of the existence 
of anti-ferromagnetic properties. Since the Curie point is a point of 
phase transition of the second kind, a number of properties undergo 
an abrupt change in passing through it. 

Direct evidence of the existence of anti-ferromagnetic properties 
has been obtained by means of neutron diffraction methods. The 
scattering of neutrons by a lattice (see Fig. 293) is sensitive to the 





Chemical period 
Magnetic period 
Fig. 293 





chemical period, rather than to the magnetic period, of structural 
repetition. 

Domain Structure of a Crystal. In examining the domain structure 
of a ferromagnetic monocrystal by the powder method, which was 
described earlier, we note that a domain is never very large, i.e., 
its linear dimensions are usually no greater than0.01 mm. It is found, 
moreover, that cubic ferromagnetic substances have extraordinar- 
ily symmetric combinations of differently oriented domains. These 
two circumstances require explanation since, it would seem, that 
thanks to the ease of magnetisation an entire crystal should be trans- 
formed into a single domain. 

L. D. Landau and E. M. Lifshits have shown that a domain strut- 
ture of the kind shown in Fig. 291 is a natural consequence of the 
existence of different energy forms in a ferromagnetic body. The 


essence of the theory is illustrated in Fig. 294. The first diagram corre- 


1 


sponds to a single domain, the magnetic energy of whichis <— \ H?dt. 


But the energy corresponding to the second configuration is 
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only one-half of this value. In the case of N parallel domains, the 
energy will be about v of that for a single domain. However, this 
dividing process will be advantageous only up to a certain point. 


Beyond that point, the energy of boundary layers exceeds the decrease 
in energy associated 


a with the division of a 
NNN NN 358 §WYYy crystal into domains. 


The advantage of con- 
figurations which consist 
of domains forming 
closed circuits is evident. 
In such cases, a closed 
magnetic flux circuit is 
formed and the energy 
of the field outside the 
crystal equals zero. 

In the case of cobalt, 
which has a magnetisa- 
tion direction along its 
axis, we encounter do- 
mains in which the mo- 
ments are oriented only 
along the axis of a hexa- 
gon. The zero magnetic 
moment of a body in the 

Fig. 294 absence of an external 

field is realised as fol- 

lows: half of the domains have one orientation and the other half 
the opposite orientation. 

A few words regarding the boundary between domains. This boun- 
dary layer is shown schematically in Fig. 295. We see that in this 
layer the magnetic moments gradually change direction. The thickness 
of the layer is determined by the requirement for minimum energy. 
Two opposite tendencies occur here. On the one hand, it is desirable 
to extend over a thick layer—this being of greater advantage with 
respect to exchange energy—the disadvantageous process of spin 
turning. On the other hand, it is better to complete this process rapid- 
ly since in the transition layer the spins are at an angle to the 
directions of easiest magnetisation. 

Now, let us consider what happens in a ferromagnetic substance 
when an external field is switched on. Themagnetisation process 
may be followed by the powder technique. It transpires that the basic 
mechanism of magnetisation consists in the growth of a domain, which 
points in the “required” direction, by means of boundary displacement. 
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The domains which form an acute angle with the field “swallow up” 
those which form an obtuse angle with the field. At the beginning 
of magnetisation, domain boundary displacement is reversible, i.e., 
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Fig. 295 


when the field is switched off the initial boundaries of the domains are’ 
restored. Later on, domain boundary displacement becomes irrevers- 
ible. Finally, when a very high degree of magnetisation is reached, 
the direction of magnetisation of the domains 
begins to turn. This is illustrated in Fig. 
296. 

In polycrystalline substances, the situation 
is exactly the same (assuming the crystal 
particles are not extremely small since domains 
are not formed when the size of the particles 
are less than 10~° cm), i.e., each grain may 
consist of several domains. In so far as the 
crystallographic axes of the particles have 
random orientations. in the body, however, 
the magnetic moments of the domains orient 
themselves randomly. Thus, the simple mag- 
netisation diagrams which have come down to 
us from the days of Ampére provide a correct 
picture of polycrystalline substances. 

Hysteresis effects, which are characteristic 
of all ferromagnetic materials, are due to the 
irreversible nature of domain boundary dis- 
placement during magnetisation. Fig. 296 





CHAPTER XXXVII 


EFFECT OF ELECTRON STRUCTURE ON PROPERTIES 
OF BODIES 


267. Free Electrons 


Until now, in discussing the structure of solids and liquids, we did 
not pay particular attention to the role of electrons in the formation 
of the properties and structure of these bodies. We were able to do 
this because the electron structure of bodies is by no means always of 
prime importance. In a number of cases, however, the role of elec- 
trons must be taken into account. There are two “kinds” of electrons in 
a body, viz., bound electrons and unbound (free) electrons. Bound 
electrons are component parts of a specific atom, ion or molecule. 
Unbound electrons belong to the entire crystal or liquid and may move 
quite freely between atoms. 

In molecular substances, the picture of electron structure is 
particularly clear. In most cases, there are no common electrons, 
i.e., none of the electrons leaves the “bounds” of a molecule. In 
ionic crystals, the restriction of electrons is not quite so clear. Even 
according to the classical view of an ionic bond, it cannot be as- 
sumed that electron exchange is completely absent. Nevertheless, elec- 
trons passing from ion to ion (exchange electrons) in ionic crystals do 
not behave like free electrons; their displacement in such a crystal 
consists in the transfer of an electron from one atom to its neighbour. 
This is quite clear in crystals having a homopolar bond. Diamond 
is an insulator, although the electrons binding the carbon atoms are 
by no means restricted to specific positions, but are relayed from atom 
to atom. 

Metals differ quite considerably from all of the bodies mentioned 
above. Here, we encounter electrons for which the term “free” is 
entirely justified. Electrons are displaced in a metal just like gas 
particles in a tube filled with obstructions. Atomic products (ions) 
in a state of thermal vibration act as obstructions. The presence of 
free electrons is revealed primarily in conduction phenomena as well 
as all experiments involving the escape of electrons from a body. 
This type of phenomenon could not be explained without considering 
the peculiar behaviour of common electrons. 

It would be incorrect, of course, to assume that the division of 
electrons into bound and free electrons is absolute. This may rather 
be considered an idealised division. In solids we may encounter elec- 
trons which are bound in various degrees. This became particularly 
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evident when physics assigned a rightful place to semiconductors, 
which occupy an intermediate position between a system of ideally 
free electrons and a system of exchange electrons, or a system of 
electrons bound to molecules. It is now known that any transi- 
tional type of structure is possible. 

It should be noted that an electron in a solid, just like an atomic 
electron, obeys the laws of wave mechanics. The representation of an 
electron as a spherule has validity within the limits imposed by the 
principle of uncertainty. Usually it is physically meaningless to 
speak of the path of an electron inside a metal. A description of the 
electron structure of a body consists primarily in indicating how 
its electrons are distributed according to energy. 

Theory shows that the electrons of a body may be represented as an 
electron gas, but this statement must be properly qualified. It tran- 
spired that it is possible to picture the electrons of a metal as a gas 
of fictitious particles, the effective mass of such a particle being de- 
pendent on its direction of motion. This point is made in order to 
caution the reader against making a superficial analogy between an 
electron gas and a gas consisting of molecules. 


268. Energy Levels in a Solid 


The energy levels of a free atom were discussed earlier. Such 
energy levels may be determined experimentally, i.e., by observing 
the energy transitions which occur when light is emitted or absorbed. 
When an atom possesses many electrons, a unique group of four 
quantum numbers is associated with each electron; according to the 
Pauli exclusion principle only one electron may exist in a given quan- 
tum state. Therefore, energy levels have a limited capacity. The s 
levels of an atom may contain two electrons, the p levels six, etc. 
This information may be determined experimentally and as a conse- 
quence of the fundamental laws of quantum mechanics. 

To determine the energy levels of a system consisting of a large 
number of atoms, we should use both approaches here as well. The 
fundamental theoretical concepts remain unchanged in the case of a 
system consisting of thousands of millions of atoms. Therefore, it 
may be concluded that the number of quantum states in a system con- 
sisting of n atoms will be n times the number in a free atom. Thus, 
the Pauli exclusion principle can be satisfied: only one electron wilh 
exist in a given quantum state. 

An atom never completely loses its identity in a given body. Spec- 
tral investigations indicate that significant changes affect only outer, 
valence electrons, which are responsible for interaction between atoms.- 
Therefore, the quantum states of a solid must be closely related to the 
quantum states of an atom. Let us consider, for example, the elec- 
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trons of a K shell, which is closest to the nucleus of an atom. It is 
evident that the state of such an electron can change only very 
little when atoms are combined in a body. Nevertheless, the Pauli 
exclusion principle does not allow us to consider all K electrons as 
equal. It becomes necessary to assume that n extremely close K levels, 
each of which consists of a pair of electrons of opposite spin, exist 
in a body consisting of n atoms. 

This reasoning also carries over to the other energy levels. It is 
assumed that the quantum states of a body are related to that of an 
atom by the following rule: a body consisting 








3p of n atoms has n times as many energy 
38 levels as an individual atom. Each level of 
a free atom yields n close levels in a solid 
body. This means that the energy levels 
of a body may be viewed as a system of 
bands. Each band is a split level of an 
ae atom. Therefore, the same designations may 
es be used for the band as in atomic spectroscopy: 
ls, 2s, 2p, etc. The number of electrons ina 
band will be, of course, n 
times the number of electrons 

1S 


in the corresponding shell of 

zr an atom. Thus, in the Is and 
2s bands there will be 2n 
electrons, in the 2p band 6n 
electrons, etc. 

The width of a band depends on the interaction forces between 
atoms. This concept is illustrated schematically in Fig. 297. The 
energy levels of a sodium atom are shown at the left and the expan- 
sion of the levels into bands in the formation of a crystal lattice are 


shown at the right. The quantity 4 is plotted along the abscissa. 


Fig. 297 


Perceptible expansion of the 1s level does not occur since the required 
interatomic spacings are absolutely unrealisable. The 2s and 2p 
bands are also practically unexpanded under normal conditions 
(indicated by a dotted line). On the other hand, the 3s and 3p bands 
are expanded to such an extent that they overlap. This means that 
the interaction which occurs between sodium atoms under normal 
conditions affects only outer electrons. (Sodium has no electrons in 
the 3p state. Nevertheless, we shall also be concerned with unoccupied 
energy levels when the exciting energy is sufficient to transfer an elec- 
tron to such a level.) 

What is the significance of the overlapping of the 3s and 3p bands? 
Actually, our scheme of correspondence between the energy levels of 
an atom and a solid fails in this case. However, we shall not let this 
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disturb us. The overlapping of the bands signifies that the wave func- 
tion properties of an electron in the overlapping region differs from 
the wave function properties of an atomic electron. Thus, the 
outer electron of a free sodium atom is an s electron. In liquid and 
solid sodium the 3s and 3p bands overlap; the behaviour of the outer 
electrons of sodium differs from the behaviour of an s electron, i.e., 
certain special (hybrid) properties appear (such electrons reflect the 
peculiarities of s and p wave func- 
tions). n(€) 

The described behaviour may 
be established experimentally by 
means of spectral methods. The pres- 
ence of an energy band rather than : ras 
a distinct energy level can be es- 
tablished by examining the transi- 
tion ofelectronsfrom ahighbandto #€) 
a low one. That which would have 
produced a sharp line in the case 
of a free atom now produces a 
broad spectral band. aes 

It is more convenient to examine 
transitions from an energy band  jn¢e)f 
to a single distinct level—for 
example, in the case of sodium, 
transitions to the 2p level. The 
spectral band obtained in this man- é— 
ner enables us to determine the ; 
width of the energy band as well as Fig. 298 
the electron distribution according 
to energy. In the case of sodium, this requires that electrons be 
dislodged from the 2p shell. The frequencies of the resulting transi- 
tions lie in the region of soft X-rays (several hundred Angstroms) 
and are very difficult to detect. Special X-ray tubes, in which the 
anode serves as the material under investigation, are used in such 
studies. 

From measured values of the intensity of the obtained spectral 
band, we may plot a curve of intensity as a function of the frequency 


v. But vate (where @ is the transition energy, i.e., the energy rela- 


tive to the distinct level), and the intensity at a given v is proportiorfal 
to the number of electrons having an energy &. Curves of n (6) as a 
function of &, where n (6) is the fraction of electrons in the band hav- 
ing an energy between 6 and € + dé, may be plotted from experi- 
mental data. Three typical curves are shown in Fig. 298. The first 
curve corresponds to an energy band in which the maximum energy is 
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sharply defined. This signifies that all lower energy levels are occu- 
pied. The abrupt drop in the curve indicates that the lower levels 
are filled to capacity (two electrons per. level). The second curve is 
typical of elevated temperatures. In this case, the edge of the band is 
smeared and the order of magnitude of the width of the smeared 
region is equal to kT. This means that some of the electrons are in 
an excited state and can occupy higher levels. The third curve, which 
shows two nonoverlapping bands, is very interesting. The lower band 
is filled and the upper one is partially occupied. A forbidden band is 
located between the two allowed bands. : 


269. Electron Gas 


It is evident from the preceding article that as far as solid-state 
theory is concerned only the upper energy bands are of interest, since 
electrons at lower levels practically do not take part in interactions 
between atoms. How can the behaviour of upper band electrons be 
described? Since we are dealing with a very large number of electrons, 
it is natural to use statistical physics methods and consider an aggre- 
gate of such electrons as a kind of gas. 

The state of each electron of such a gas may be represented by a 
point (p,, p,,p,) in momentum space. The direction of motion of an 
electron is parallel to its radius vector p and the energy of an electron 
depends on its momentum. In a crystal, the energy of an electron will 
depend on its direction of motion. Let us disregard this for a moment 
and assume that the electrons behave like free particles. Despite the 
fact that this is a rough approximation, i.e., that we neglect the 
potential energy of the field in which the electrons move and the in- 
teraction between electrons, the results give us a good description, 
at least qualitatively, of the behaviour of the electrons of a solid 
which form an energy band. 

If the electrons are free, the relationship between their energy .and 


momentum is given by the formula €= oP. This means that in mo- 


mentum space a surface of equal energy is a sphere. It is customary to 
call such a sphere a Fermi sphere, after the famous Italian physicist. 
As indicated in the preceding article €,,,,, the maximum energy of the 
electrons in a band, may be determined experimentally. We can say, 
therefore, that the states of an electron gas are contained in a sphere of 
radius Pax =V 2ME max: Thus, it would not be incorrect to call this 
Fermi surface a surface of maximum energy. 

To qualitatively check the validity of this theory, let us estimate 
from the value of @,,,,, the number of electrons in a band. We may rea- 
son as follows. According to the principle of uncertainty, the projection 
of the momentum of a particle in a metal body of linear dimension L 
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cannot be determined with greater accuracy than - Therefore, 


Es 
in momentum space, the concept of a point should be replaced by the 
concept of a cell of volume hn’ where V is the volume of the metal 


V ? 
body under consideration. One of the basic postulates of the theory 
is that such a cell represents a quantum state and that it can contain 
no more than two electrons of opposite spin. If there are N electrons 


in a volume V in the band under consideration, then = cells are occu- 
N AS 


77 . This is the volume of a Fermi sphere of 


pied, i.e., the volume 
radius Pyg,. Thus, 


4 Sara YP NP 
Aa 1 (VV 2m€ max) = oy 
_ Perfectly reasonable values of N may be obtained using this equa- 
tion. This means that the above assumptions are more or less valid. 


Example. In a metal, the maximumenergy, determined experimentally, is 


Emax~ 10 ev=1610" erg. Using this value, we obtain Pmgy=V 2ME max © 
~2xX107% gm cm/sec, i.e., the maximum electron velocity in a metal is 


Pmac _ 2x107° 


°max =n 9x10 


~ 2x 108 cm/sec. 
Hence, the number of electrons in a unit volume is 


N=2x— mu (V 28 max) a 10%. 


The above discussion assumed that the temperature is at absolute 
zero. At a higher temperature, electrons may pass over into momen- 
tum-space cells which correspond to higher energy. Such a transition 
will take place for electrons located in cells close to a Fermi surface 
(otherwise too high a transition energy is required, which is unlikely 
to obtain) and the boundary of the sphere will be broad (not distinct). 
Only at very high temperatures will the excitation affect low-energy 
electrons. As the temperature is increased, the degree of degeneracy-of 
the electron gas decreases. An electron gas has a high degree of degener- 
acy, particularly at low temperatures. The term “degeneracy” signifigs 
that different quantum states have one and the same energy. 

The distribution of electrons according to energy ata given tempera- 
ture may be calculated. This distribution differs from a Boltzmann 
distribution. According to the Boltzmann law, at absolute zero, the 
energy of electrons should be equal to zero. From the viewpoint of the 
new theory, electrons should have a high energy at absolute zero 
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(this follows from the Pauli exclusion principle).* On the basis of the 
Pauli exclusion principle, we can construct a new statistics (Fermi- 


é 
Dirac statistics), in which the function e *? is replaced by the ex- 
pression 
1 
E— Emax 
e kT neha 

where 6,2, is the maximum possible energy of the electrons at abso- 
lute zero. Multiplying this factor by the electron distribution at abso- 
lute zero yields the electron distribution at any temperature. 





Fig. 299 


Fig. 299 shows the dependence of the Fermi-Dirac function on & 
when kT=0.1 and 2.5 ev. 

It should be noted that different particles obey different statistics. 
Molecules obey Boltzmann statistics, photons Bose-Einstein statis- 


tics, and electrons (and other particles having a spin of z) Fermi- 


Dirac statistics. 
The difference in statistical approaches consists in the different 
methods of distribution of particles according to their possible'states. 
Let us assume that there are two states in which two particles may 
be possibly located. In Boltzmann statistics, where particles possess 
individuality, the following possibilities must be considered: 1) both 
particles in the first state; 2) both particles in the second state; 3) the 


* ©max has an order of magnitude of several electron-volts, while the average 
energy of thermal motion (kT) is equal to several hundredths of an electron- 
volt. Thus, electrons move rapidly even at absolute zero. The velocity of elec- 
trons at absolute zero is 1,000 times as great as the velocity of atoms at room 
iemperature. This should be re-emphasised in order to make it clear that the 
relationship existing between kinetic energy and temperature in the case of 
molecules is not applicable in the case of electrons. It follows, moreover, that 
an electron gas has negligible thermal capacity. The thermal capacity of a 
body is not affected by the presence of an electron gas. 
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first particle in the first state and the second in the second state; 
4) the first particle in the second state and the second in the first 
state. Thus, there are four possibilities in all. 

In Bose-Einstein statistics, particles are indistinguishable from one 
another. There are therefore three possibilities: 1) both particles in 
the first state; 2) both particles in the second state; 3) one particle 
in the first state and one in the other. 

In Fermi-Dirac statistics, the Pauli exclusion principle is taken 
into account: only one particle may be in a given state. The number 
of possible distributions is reduced to one, i.e., one particle ineach 
of the two states. 

Thus, outer electrons of the atoms of a solid behave like an elec- 
tron gas. This is a very peculiar kind of gas, i.e., its particles obey 
Fermi-Dirac statistics. 


270. Conductivity 


In the absence of an electric field, the state of an electron gas is 
such that the number of electrons moving from right to left is equal 
to the number moving from left to right. When a field is applied, 
forces are produced which make the electrons move parallel to the 
field. The distribution of electrons in momentum space becomes 
nonsymmetrical with respect to the origin, i.e., a displacement 
occurs in the direction of the field. An ordered motion which pro- 
duces an electric current is superimposed on the extremely rapid 
random motion of the electrons. 

For the distribution of electrons to be displaced, electrons must, 
to be sure, pass to higher energy states. Such a transition is always 
possible if there are vacancies in the energy band. If the energy 
band is fully occupied, i.e., if all its levels are occupied by elec- 
trons as allowed by the Pauli exclusion principle, the electrons have 
no place to go—in any case, not until they acquire sufficient energy 
to make a transition to the next band. 

Were it not for the overlapping of bands (discussed above), one 
could assume that all elements having one valence electron are 
conductors and all elements giving up two electrons to be shared 
in the process of forming a solid are insulators. Thus, sodium has 
one electron at the 3s level. In the formation of a body consisting 
of N sodium atoms this level splits into NW levels. At each level thgre 
may be two electrons of opposite spin, i.e., a total of 2N electrons. 
But since we have only N valence electrons, half of the energy band 
is unoccupied. Magnesium, the next element in the Mendeleyev 
periodic table, has two electrons (per atom) at the 3s level. There- 
fore, in the formation of a magnesium crystal, all levels would be 
occupied were it not for the overlapping of energy bands. 
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Investigation of the form of energy bands of various elements 
shows that the above explanation of the origin of conducting prop- 
erties is completely valid. Only when the upper band or the 
merged bands are not fully occupied may the body be classified as a 
conductor. 

The distribution of the electrons of a conducting body in momen- 
tum space may be displaced in the direction of a field. Since the 
number of electrons moving in the direction of a field does not equal 
the number of electrons moving in the opposite direction, an elec- 
tric current is produced. In an insulator, all the energy bands are 
completely occupied. Ordinary field intensities are incapable of 
creating the forces necessary to transfer electrons to the next higher 
band, and if we persist in trying to transfer electrons of an insula- 
tor to the next band the dielectric breaks down. The electron dis- 
tribution maintains its symmetry in momentum space and the num- 
ber of electrons moving to the left remains equal to the number of 
electrons moving to the right, i.e., no current flows. 

To return to the discussion of conductors, let us now roughly es- 
timate the magnitude of the electrical conductivity of a body which 
has n free electrons in a unit volume. By free electrons or conduction 
electrons, we mean electrons located in unfilled energy bands. 

Assume that the motion of an electron under the action of an ac- 
celerating force eE occurs during a small time interval tHe. 
Here, v is the velocity of an electron and / the length of its mean 
free path. The path is traversed at the extremely high random veloc- 
ity of the electron. The velocity of the ordered motion of the elec- 
trons creating the electric current is many orders of magnitude less 
than the random velocity and, therefore, is not included in the de- 


: , . ; : eE 
nominator of the expression for t. Motion with an acceleration a 
f 


during a time interval increases the electron velocity to ee Thus, 
the approximate value of the velocity of the ordered motion of the 
electrons creating the current is u =I : 


The density of the electric current is simply the quantity of elec- 
tricity passing through a unit area per unit time, i.e., fj = neu. 
Substituting the above value of u, we obtain 

j=l 
mu 

Since Ohm’s law in differential form is given by j =oE, a relation 
may be obtained for electrical conductivity: 

nel 
™~ "me * 
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This gives us only a rough estimation of the electrical conductiv- 
ity. In view of the assumptions made for purposes of simplification, 
calculated and experimental results will differ by as much as an or- 
der of magnitude. However, we are interested merely in a qualita- 
tive picture. It can be seen that the conductivity is proportional to 
the number of free electrons. This number and the length of the free 
path may vary from substance to substance. 

Example. If the length of the free path of an electron in a metal is 7~10A= 
=107’cmand v is of the order of magnitude of 10° cm/sec (see the example on 
p. 677), the free path is traversed in a time t=107 * sec. 

Assume that the voltage drop along a I-cm segment of a metal conductor 
with a l-cm? cross-section is equal to 0.003 v=10~* Gaussian unit. Then, E=10~* 
Gaussian unit and the velocity of the ordered motion of an electron is 
un ae ~5x107% cm/sec. The current density is j=neu~ 104.8107 '°X 
x5 x 107%~ 30 10? Gaussian unit= 100a/cm?. This yields quite reasonable values 
of conductivity: 


o ~ 25x10" Gaussian unit + 28x 10* ohm7? cm7", 


If a crystal had an ideal lattice and the temperature approached 
absolute zero, there would be no restriction on the length of the free 
path and the material would have no electrical resistance. The elec- 
tron range is limited by atomic thermal vibrations and the presence 
of various crystal imperfections. Both factors disturb the ideal pe- 
riodicity of the field in which an electron moves and result in the 
scattering of electrons. It follows that the conductivity of a body 
improves as its temperature decreases and approaches a limit which 
is determined by the degree of perfection of the crystal lattice. 

It can be shown experimentally that the resistance of a metal de- 
creases with temperature. This would indicate that the theory is val- 
id for metals. Moreover, the fact that electrical resistance decreases 
with temperature is an essential characteristic of metals. The plas- 
tic deformation of a metal, the impairment of its lattice by nuclear 
bombardment, and in general any action serving to damage the lat- 
tice will reduce the length of the free path and therefore result in 
increasing the electrical resistance. 

In Part I (p. 219), the thermal conductivity of gases was discussed. 
It was shown that the thermal conductivity of a gas is propor- 
tional to the length of the free path and is given by the formula 
x~ovle,. Is this formula useful for the calculation of the thermal 
conductivity of metals? Electrons are much lighter than atoms and 
one is justified in assuming that heat is transmitted by electrons 
which transfer energy from one atom to another. Since the length of 
the free path is not known, one cannot calculate the coefficient of 
thermal conductivity. However, it should be noted that the ratio 
of the coefficient of electrical conductivity to the coefficient of ther- 
mal conductivity does not contain unknown parameters and depends 
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only on universal constants and temperature: 


u 
—=const T 
o 


(Wiedemann-Franz formula). Experimental results agree fairly 
closely with the value obtained by means of this formula. The follow- 


ing table gives the values of the quantity ar at 0° C for a number 
of metals. 
Cu 


Metal | Ag | Au Mo 











Po | Pt 











2.31 | 2.35 | 2.23 | 2.61 | 2.47 | 2.51 | 2.52 | 2.31 





























The theoretical value of this quantity is equal to 2.45 x 10-%, 


271. Superconductivity 


Since a crystal always has a considerable number of imperfections, 
it will usually possess a residual resistance, which is reached at a 
temperature of several degrees Kelvin, i.e., below this point the resist- 
ance does not decrease with temperature. However, there exist about 
ten metals which behave quite differently. At definite temperatures 
close to absolute zero, such metals completely lose their electrical 
resistance. When an electric current is induced in such a supercon- 
ductor, the current will flow in the circuit for days. This shows that 
the resistance has not simply decreased, but has dropped abruptly 
to zero. 

Of the pure metals, niobium has the highest temperature (9°K) 
and hafnium the lowest (0.3°K) at which superconductory proper- 
- ties appear. 

It might seem that superconductivity is a property common to 
all metals, i.e., if the temperature is reduced sufficiently super- 
conducting properties will appear. This is apparently not so. The 
temperature of numerous materials has been reduced down to 0.03°K 
without superconducting properties being manifested. The suppo- 
sition that such properties are not universal is supported by the fact 
that superconducting metals occupy a definite part (the middle) 
of the Mendeleyev periodic table. 

Superconducting materials include, in addition to pure metals, 
numerous alloys of such nonsuperconducting metals. Moreover, a 
chemical compound may be a superconductor even though neither 
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of its components is one. Thus, copper sulphide is a superconductor, 
but copper and sulphur are not. Niobium nitride already reveals 
superconducting properties at 30° above absolute zero. 

The disappearance of electrical resistance at a temperature T, is 
not the only peculiarity of superconductors. 

Another mark of a superconductor is its characteristic behaviour 
in a magnetic field: generally speaking, a magnetic field penetrates 
such a conductor to a depth of only about 1,000A. If very thin films, 
the behaviour of which is somewhat peculiar, are left out of consid- 
eration, we may make the following generalisation: the magnetic 
field inside a superconductor equals zero. 

However, this is true only as long as the applied external field 
does not exceed a certain critical value H,. When this value is ex- 
ceeded, the superconducting state disappears—the magnetic field 
penetrates the material and electrical resistance is restored. 

H, is afunction of temperature, i.e., it is not a constant. At the 
temperature T,, a very weak external field suffices to destroy the 
superconducting state. Generally speaking at T — T, the critical 
intensity H, equals zero. H, gradually increases as the temperature 
is decreased, and at absolute zero it reaches its highest value. For 
example, the maximum value of the critical field intensity of mer- 
cury (T,= 4.2°K) is 412 oersted. 

Electrical resistance is due to the scattering of electrons by the 
thermal waves of atoms in a crystal lattice. These thermal waves 
exist, as we know, because of the presence of a zero energy, even at 
absolute zero. It would seem, therefore, that electrical resistance 
should not disappear no matter how much the temperature is de- 
creased, 

How is it possible then to have thermal scattering of electrons 
and no resistance to electric current? This problem was not solved 
until 1957 when it was proved by means of quantum mechanics that 
electrons in a thin energy layer next to a Fermi surface are able to 
become “paired” thanks to interaction with the thermal vibrations 
of a crystal lattice. It transpired that at low temperatures it is ad- 
vantageous from the energy viewpoint for two electrons of equal 
spin magnitude but opposite spin direction to become “united”. 
The words “paired” and “united” have been placed in quotation marks 
because calculations indicate that the wave functions of these elec- 
trons extend over a large distance, viz., of the order of 10-* cm (the 
size of a crystal grain in an ordinary polycrystalline metal). There- 
fore, the formed pairs should not be viewed as peculiar “molecules”, 
since the bond is implemented over a large distance by means of ther- 
mal waves. 

It follows from the theory that all electron pairs are identical in 
the sense that they have the same total momentum. 
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“Matter” consisting of such electron pairs possesses superconduction 
properties. The formation of electron pairs does not eliminate the 
thermal scattering of electrons. Superconduction occurs because the 
scattering of the electrons of a pair ceases to affect the current strength. 
Thermal scattering can only break up one or another pair or form a 
new pair from separate electrons, but the magnitude of the current 
is determined by the total momentum of the electrons, which remains 
unchanged. Thus, according to this model, the thermal scattering 
of electrons may produce electric current fluctuations, but it cannot 
stop the current. : 

A superconductor contains, in addition to “paired” electrons, an 
ordinary electron gas, i.e., a gas of individual electrons. Thus, in a 
superconductor, there exist, so to speak, two fluids—an ordinary 
fluid and a superconducting one. If the temperature of a superconduc- 
tor begins to rise from absolute zero, thermal motion will break 
up more and more pairs of electrons, i.e., the ratio of the ordinary 
electron gas to the superconducting electron gas will increase. Final- 
ly, the critical temperature is reached and the last electron pairs 
break up. 

The new theory provides a quantitative explanation of all of the 
superconduction phenomena discussed above. 


272. Semiconductors 


Properties. A large group of substances (various elements and 
chemical compounds), the conductivity of which lies in the broad 
interval between those of conductors and insulators, are classified 
under the heading of semiconductors. Since a voltage of | volt will 
produce currents of several hundred thousand amperes in a cube of 
metal having a volume of 1 cm*, and currents of the order of 10~*°a 
in insulators under the same conditions, one can see that the interval 
occupied by semiconductors is extremely large. 

The conductivity of substances in this interval has a number of 
peculiarities which enable us to “recognise” semiconductors. 

First, it should be noted that the dependence of conductivity on 
temperature is opposite to that of metals. The conductivity of semi- 
conductors, in contradistinction to that of metals, may decrease 
rapidly with temperature. At low temperatures, a semiconductor 
may become an insulatcr. The resistance of most semiconductors 
is considerably more sensitive to changes in temperature than met- 
als. Compact temperature meters of high sensitivity may be con- 
structed using semiconducting thermal resistors (thermistors). 

A second important feature of semiccnductors is that in a number 
of cases they may possess positive (p) as well as negative (n) conduc- 
tivity. The terms positive and negative are used in the following 
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sense: if the current 1s due to the motion of positive charges thecon- 
ductivity is called positive and if it is due to themotion of negative 
charges it is called negative. Thus, metals have negative conduc- 
tivity since the current is due to the motion of electrons. Both kinds 
of conductivity occur in semiconductors. This effect seemed strange 
at first since the flow of 
current in a semiconduc- 
tor is not associated (as 
in an electrolyte) with 
the displacement of ions, 
and the question of the 
nature of positive car- 
riers of current remained 
open for a period of time. 

The sign of the cur- 
rent carrier may be de- 
termined in a number 
of ways. Let us examine 
the most convincing evi- 
dence, which is based on 
a study of the forces 
exerted by a magnetic field on current-carrying particles (Hall 
effect). Ii an electric current flows along a plate which is placed 
perpendicular to magnetic lines of force then, on a charged par- 
ticle e moving with a velocity u, a force F will be exerted in a 
direction perpendicular to the field and current (see Fig. 300). In 
other words, an electric field of intensity E—uB will be created in 
such a direction (see p. 294). 

A potential difference U=uBd is produced between the plate faces 
perpendicular to the created field. The sign of this potential differ- 
ence is determined by the sign of the charge carrier. 





Fig. 300 


Example. Consider a 1X2 0.5-cm® semiconducting plate in a magnetic field 
B=1,000 gausses. Assume that the conductivity o of the plate is equal to 
30hm~! cm™! (zinc oxide). If a potential difference of 1 volt is applied between 
the plate faces which are separated by a distance of 2 cm, an electric current of 


density j= oF = 3X + =15 a/cm? will flow through the plate. 


Experiments show that a potential difference U=0.12 mv is produced between 
the lateral surfaces of the plate. The sign of the Hall effect (see Fig. 300) indi- 
cates that the charge carriers are electrons. The velocity of their ordered motion is 


_ U__0.12x107*y 
Ba 0.1 ZSe 
me 


u =0.12 m/sec=12 cm/sec. 


X107 2m 





Note that this velocity is more than 1,000 times as great as the velocity of the 
ordered motion of conduction electrons in a metal (see the example onp. 681). 
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The number of conduction electrons per unit volume of semiconductor is 


. poy 
nah ——_1 5 alem’ __g x10" em’, 
eu 1.6x10 Cx 12 cm/sec 


ie lows value of o is due to the fact that this n is about a millionth of that of a 
metal. 

A final important feature of semiconductors is their extreme sen- 
sitivity to impurities, which not only affect their conductivity (an 
impurity of the order of one per cent may change the conductivity of 
a semiconductor by a factor of a million or more), but may change 
n-conductivity into p-conductivity and vice versa. 

The most important semiconductors already of practical signif- 
icance include germanium, silicon, selenium, antimony alloys (con- 
taining indium, cadmium and zinc), and copper and titanium oxides. 

Interpretation of Properties. Most characteristics of semiconduc- 
tors can be easily explained by means of an energy level diagram. 
Insulators have a filled energy band. The next unfilled band is sep- 
arated from the filled band by a large energy gap. Imagine that the 
system of levels of a substance is such that the gap between these 
bands decreases and the energy of thermal motion suffices to transfer 
electrons from the filled band to the unfilled band. Such a substance 
will act as a natural semiconductor. 

At a given temperature, the number of electrons in the upper band 
will be determined by the dynamic equilibrium conditions estab- 
lished between bands. Electrons continuously pass from the lower 
band to an excited state and vice versa. Just as in the case of a sat- 
urated vapour, equilibrium will prevail when the number of elec- 
trons moving “upwards” equals the number of electrons moving 
“downwards”. 

Again as in the case of a saturated vapour, when the temperature 
is raised the equilibrium is displaced in the direction of the upper 
level, i.e., the instantaneous concentration of electrons in the upper 
band increases. The concentration of free electrons rises sharply 
with the increzse in gap between bands. The probability of surmount- 
ing an energy barrier of width A@ invariably appears as an exponen- 
tial factor. The approximate concentration of electrons in the upper 
band at a temperature T may be determined from the formula 

Ag 
n=10%xe 24%, 

If a body has a gap A@ which is significantly greater than RT, it 
belongs under the hedding of insulators. For this purpose, it is 
sufficient for G to be 100-200 times as great as kT. At room tem- 


perature, kT = ev, When A@ becomes less than 1 ev, i.e., about 
40 times as great as kT, the number of electrons in the upper band 
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will be sufficient to create measurable currents. When A@ is of 
the order of several tenths of an electron-volt, a semiconductor poss- 
esses considerable conductivity. 

nel 


Reference to the conductivity formula o = —— shows that when 


the temperature of a semiconductor changes two factors on which 
o depends also change. In the first place, as the temperature in- 
creases, the number of free electrons n increases, but, as previously, the 
free range I decreases with increasing temperature. However, experi- 
ments show that usually the . 

former effect overshadows the =— Field 
latter. —> Movement of electrons 

Until now, we have discussed 
the conduction properties of the 
upper band, but have disregard- 
ed the lower band, which should 
also acquire conduction proper- 
ties since vacancies are formed 
in it as the result of the transi- 
tion of electrons to the upper 
band. This conductivity may 
be very peculiar in nature. 

The creation of conductivity in the upper, partially filled band 
may be interpreted as a displacement in the distribution of electrons 
in momentum space in the direction of the field (to the right in 
Fig. 301a). However, this is not the only way in which ordered motion 
of electrons may occur. Imagine that the overall shape of the distri- 
bution of electrons does not change (see Fig. 3010). However, now 
one electron, and now another, close to the Fermi surface is snatched 
away and a “hole” is formed in momentum space. Under the action 
of the field, such a hole is immediately filled by a neighbouring 
electron moving from left to right (in the same direction as in the 
other diagram). The hole is displaced from right to left. Now, an- 
other point representing an electron in momentum space occupies 
this position and in this manner the hole moves in the opposite di- 
rection to that of the electrons. Since holes are formed continually, 
a continuous positive “hole” current flows. 

Thus, in a natural semiconductor, electric current may be viewed 
as the result of the motion of “holes” in the occupied band as well 
as of electrons in the upper band. However, the major role in such 
cases is played by the motion of electrons in the conduction 
band. 

This natural conduction of semiconductors is encountered consid- 
erably less frequently than another phenomenon, namely, semi- 
conduction properties under the action of a small percentage of 





6) 
Fig. 301 


688 Effect of Electron Structure on Properties of Bodies (Chap. XXXVII 





impurities. The role of foreign atoms or other lattice imperfections 
consists in their contribution to the system of energy levels. Fre- 
quently, imperfections create their own level—a narrow energy band 
between the filled and unfilled bands. 

Let us assume that the foreign atoms contribute “surplus” elec- 
trons which occupy a narrow band between the main bands. When 
the temperature increases, electrons pass from the level of the im- 
purities to the conduction 
band in greater and great- 
er numbers, i.e., conduc- 
tion increases. Such a semi- 
conductor yields n-con- 
duction. A point may be 
reached (for a low per- 
centage of impurities) at 
which all surplus electrons 
are given up. A further 
increase in temperature will 
not result in an increase 
in conduction and from then 
on the body will act like a 
metal. Such behaviour may 
be detected when pentaval- 
ent arsenic or phosphorus 
atoms are introduced into 

Fig. 302 a lattice of quadrivalent sil- 

icon or germanium. Fig. 302 

shows a simplified diagram of the crystal lattice of silicon. If a 

silicon atom is replaced by an arsenic atom, a “surplus” electron is 
obtained. This will be a conduction electron. 

It is interesting that impurilies may result in p-conduction. This 
occurs when an impurity atom has acceptor properties, i.e., can 
attract electrons. Electrons pass from the filled band to the inter- 
mediate level of the impurities and as a result hole conduction occurs 
in the filled band. Such conduction occurs in silicon containing 
a trivalent aluminium impurity. If at a number of lattice sites 
silicon is replaced by aluminium, “electron-deficient sites” will 
exist in the crystal. When a field is applied, an aluminium atom may 
attract an electron from a neighbouring silicon atom; the electron 
falls under the action of the electric field and a “hole” moves in the 
opposite direction. 

It should be noted that such “unsophisticated” models of the dis- 
placement of an electron are greatly oversimplified in view of the 
fact that the motion of electrons in a solid satisfies the laws of quan- 
tum mechanics. 
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By adding one or another impurity, we are able to vary the conduc- 
tivity of materials within very broad limits. We may change p-type 
conductivity into n-type conductivity and may significantly change 
the nature of the temperature dependence of the conductivity. 


273. Emission of Electrons 


Work Function of an Electron. Electrons in a conduction band 
behave like an electron gas. The surface of a solid acts as the “walls” 
of the vessel in which this gas is located. To leave the bounds of this 
surface, an electron must sur- : 
mount a potential barrier the 
height of which is designated 
by &. At absolute zero, elec- 
trons have a limiting energy 
W. In the model of an electron 
gas, W corresponds to a Fermi 
surface. This is the energy of 
electrons which at absolute 
zero are located at the highest 
level. Thus, in order for an 
electron to surmount the potential barrier, it is not necessary to 
impart to it an energy @; it is sufficient to give it an additional 


energy esp 


The quantity A is called the work function and 4 =the potential 


function, i.e., the work function expressed in volts (see Fig. 303). 

An electron’s escape from a metal is impeded by the forces of attrac- 
tion exerted by positive ions as well as the force of attraction be- 
tween the electron and its electrical image. The latter force is equal to 


oes where x is the distance of the electron from the surface. This 


force is capable of holding an electron at a considerable distance 
from the surface and thus forming a layer or electron cloud close to 
the surface of the body. 

If the metal is located in an electric field, the work function de- 
creases by an amount eV cE, where e is the electron charge and E is 
the field intensity. The intensity of the external field must equal 
0.2x10’v/em if the potential function is to decrease by Iv. (Thus, 
in ordinary electronic instruments the external field has little effect 
on the work function.) 

The work function is very sensitive to changes in surface properties. 
It transpired that it is possible to deposit electrically pcsitive atoms 
or ions (metals, oxygen) on the surface of a cathode, In this manner, 





Fig. 303 


23—3580 


= 
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a layer of positive charge may form on such a surface. Thus, as may 
be seen from the following example, the work function can be con- 
siderably reduced. 

The filament of the heater in electron tubes is usually made of 
tungsten (work function A =4.9 ev). By coating the surface of the 
tungsten with a layer of an oxide of an alkaline earth metal (Ca, 
Ba, Sr), we may decrease the work function to 1.5-2 ev. This enables 
us to obtain considerably greater emission at lower filament tem- 
peratures. — 

Thermionic Emission. The escape of electrons from a metal upon 
heating is known as thermionic emission. This phenomenon is basic 
to the action of heater-cathode tubes. When the temperature is raised, 
electrons are excited, some of them acquiring a sufficient velocity 
in the direction perpendicular to the surface of the material to sur- 
mount the potential barrier @. 

An electron gas obeys Fermi-Dirac statistics, according to which 
the number of electrons having an energy @ is proportional to the 


expression “pope . But we are interested in the energies @ which 
kT 
are considerably greater than the zero-level energy W. Therefore, it 


is accurate enough to reduce the 
above expression to 





_w-€ —e9 
e BF gage. 





Thus, we may determine the num- 
ber of electrons having an energy 
equal to the height of the potential 
well. It may be rigorously proved 
that the thermionic emission cur- 
rent is proportional to this expres- 
sion. We see from the formula that 
thermionic current increases  ex- 
tremely rapidly with temperature. 
Fig. 304 The circuit shown in Fig. 304 may 

be used to measure the thermionic 

current. By increasing the voltage, we quickly reach saturation cur- 
rent. (This is the thermionic current referred to above.) The initial 
portion of the current-voltage curve is space-charge limited (see 
above). The voltage draws electrons to the plate from the electron 
cloud and the cathode emits enough electrons to maintain the cloud 
in equilibrium. In the absence of an external voltage, this equilibrium 
is determined by the electron emission of the cathode and the nega- 
tive potential of the cloud. As the voltage is increased, the electron 
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cloud begins to dissipate and emission increases until the voltage 
draws all electrons of the cloud to the plate. At this point current 
saturation sets in. 


Rigorous analysis yields the following relationship between electronic cur- 
rent and temperature: 
ee 


j=AT% *F (Richardson formula). 


For tungsten, A=75a/cm? deg? and ew=4.5 ev. Let us compare the current 
density of thermionic emission from tungsten at 500°K and 2,000°K. 
At 500°K, 


4.5xX1.6x10 7? 


j=T5X25X108xe 1+38X107"x500 . 19-96 aiem?, 


In other words, to obtain measurable currents, cathodes of impracticable 
dimensions (greater than that of the entire land mass of the globe) would be 
required. 

At 2,000°K, 

5.21x104 


2,000 


j=75x4x108xe ~ 1.6 ma/em?. 


Such a current is easily measured, but the area of the emitting surface is still 
too large for most practical purposes. 

The picture changes when tungsten is coated with caesium.Now, A=3.2a/cm*deg? 
but ep=1.36 ev. At T=2,000°K, 


_ 1.57108 
j=3.2x4x108%xe 2-900 ~~ 4 8x 108 afem’. 


Clearly, such current densities would destroy the cathode. Therefore, the re- 
quired Sad of current density, j<1] a/em’, are attained at lower temperatures 
(~1,300°K). 


Secondary Emission. This refers to emission due to the dislodge- 
ment of electrons from a metal under the action of other electrons. 
Secondary electrons may emerge, taking the same direction as the 
primary electrons. This shows that primary electrons interact with 
bound electrons; otherwise, the law of conservation of momentum 
would be violated. Secondary emission begins at a primary electron 
energy of the order of 10 ev. Most secondary electrons have an energy 
of several electron-volts—their energy distribution is practically 
independent of the energy of the primary electrons. 

Primary electrons produce secondary electrons and, in addition, 
are elastically scattered. The remarkable phenomenon of one pri- 
mary electron producing several secondary electrons has wide appli- 
cation (e.g., Kubetsky tube). 


23* 
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274. Photoelectric Effect 


Extrinsic Photoeffect. This phenomenon may be studied by mak- 
ing the cathode of a vacuum tube out of the material under inves- 
tigation. Light falling on the cathode dislodges electrons from the 
material. Electrons reaching the anode generate a photoelectric 
current, the magnitude of which may be studied as a function of 
external conditions. 

It is the total current taken from the cathode, of course, that is 
characteristic of the substance. Here too, therefore, we usually oper- 
ate under conditions of saturation current. If no voltage is applied 
across the photccell, a weak current generated by electrons leaving 
the cathode in the direction of the anode flows through the instru- 
ment. A weak accelerating voltage is not sufficient to attract all the 
electrons, but at a certain voltage all the electrons reach the anode 
and saturation current is obtained. 

Experiments show that the photocurrent is strictly proportional 
to the intensity of the incident light. This is true of light of any fre- 
quency which produces a photoeffect. 

Moreover, the number of dislodged electrons is exactly equal to 
the number of photons. One photon may dislodge only one electron. 
It is not possible for a photon to dislodge several electrons from 
a substance by a series of energy losses. This important postulate is 
somewhat difficult to prove in a study of the extrinsic photoeffect, 
since the extrinsic photoefiect may be accompanied by an intrinsic 
photoeffect (see below) in which some of the electrons do not leave the 
bounds of the substance. 

The law of conservation of energy and the law of conservation of 
momentum are obeyed in the interaction of a photon and an elec- 
tron. The law of conservation of energy (the Einstein equation) takes 
the form 


hy = eg, 


where q is the potential function of the electron for the metal (the 
same as in the thermionic emission experiments). In accordance with 
the law of conservation of momentum, it may be assumed that the 
lattice of the metal takes part in the photoelectron interaction proc- 
ess (otherwise electrons could move only in the same direction as 
the photons). 

A photon may produce a photoeffect if its energy is not less than 
the work function. It follows that each material has a photoeffect 


limit. The limiting frequency v, is equal to +9 and the limiting 
wavelength A, (in millimicrons)—the “red” limit of the photoeffect— 
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is where @ is expressed in volts. No photoeffect will 
take place when a substance is irradiated with light of long wave- 
length. The “red” limit of the photoeffect in the case of Li is A, 560 
millimicrons (5,600 A), i.e., it is in the yellow region of the visible 
spectrum; in the case of Cu, 4,300 millimicrons (3,000 A), i.e., 
the ultraviolet region; and in the case of Al, 4,=410 millimicrons 
(4,100 A), i.e., the violet region of 
the visible spectrum. 

If the energy of a photon is 
greater than the work function, the 
surplus goes into the kinetic energy 
of the electron. Thus, hard radia- 
tion can produce very fast photo- 
electrons. 

In order to measure exactly the 
limiting frequency and the work 
function, we use a retarding poten- Fig. 305 
tial method. A small bias voltage 
is applied to the photocell (plus terminal is connected to the pho- 
tocathode) and this voltage is increased until the current cuts off. 


This point is reached when vV,=> . In this manner, we may deter- 
mine experimentally the dependence of V, on the frequency of light: 





V,= goo Pp: 


The plotted curve is a straight line the slope of which is equal to 
the universal constant a The limiting frequency v, and the poten- 


tial function @ are obtained as intercepts along the abscissa and ordi- 
nate, respectively (see Fig. 305). 


Example. If softX-rays of wavelength! A=100A fall on a copper plate 
(g=4.1v), the bias voltage cuts off the photocurrent when 


27 3X10 1 


eVy=hv—ep=6.6x 10 Tae 34.1 ev=120 ev. 


1.6x10— 


Therefore, the bias voltage will equal 120v. 


Another important characteristic of a photocathode material is 
the spectral dependence of the photocathode. Here, no simple rela- 
tionship may be applied. The curve begins at the limiting frequency 
and in.many cases increases rather uniformly; one can say that the 
coelficient of utilisation of photons increases with photon energy. 
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However, in other cases, the spectral curves have well defined maxima 
which lie within a rather narrow spectral band. 

Photocells which utilise the extrinsic photoeffect have broad ap- 
plication. They are used in photo relays, television and cinema sound 
tracks. Silver, caesium and potassium may serve as photocathodes; 
antimony-caesium cathodes are widely employed. 

In various photo relay applications, the photocurrent need not be 
proportional to the intensity of light, but the sensitivity of the 
photocell should be high. In such cases, gas photocells instead of 
vacuum photocells may be used. This increases the sensitivity tens 
of times. 

Intrinsic Photoeffect. When the action of a photon results in the 
displacement of an electron from a filled band to a level of an im- 
purity or to a conduction level, it is referred to as the intrinsic pho- 
toeffect. Under the action of light, this phenomenon may produce con- 
duction electrons and holes in a body. Such conduction electrons and 
holes will occur in pairs. Strictly speaking, there will be a pair of 
charges for each photon. The phenomenon is extremely complicated 
by secondary processes which occur within a body as a consequence 
of the recombination of electrons and holes. 

It is clear, therefore, that the intrinsic photoeffect is a phenomenon 
which is particularly characteristic of semiconductors, but which 
may also occur in insulators. 

Semiconductors which possess this effect are included in current 
circuits as photo resistors. In the dark, such a body has very low 
(dark) conduction. Its conduction increases when subjected to light. 
Energies of several tenths of an electron-volt may be sufficient to 
produce intrinsic electron transitions. Therefore, the threshold of 
the intrinsic photoeffect may lie in the far infrared region. 

Photo resistors are widely used in signal systems and in automation 
when it is necessary to amplify or detect very small changes in light 
intensity. 


275. Barrier Layers 


In a number of cases, the junction between a metal and a semicon- 
ductor or between two semiconductors may have a rectifying action. 
Even though the boundary between two bodies in close contact (weld- 
ed or fused) is very narrow, nevertheless it is of finite width; hence 
the designation barrier layer. Such a layer may form at the junction 
between copper and cuprous oxide or at the junction between sele- 
nium and cadmium selenide. Investigations indicate that a barrier 
layer between two semiconductors is formed when one of the semi- 
conductors is a p-type conductor and the other an n-type. Such 
barrier layers are called p-n junctions. 
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Fig. 306 illustrates the rectification provided by barrier layers. 
The figure shows a typical current-voltage curve. The right branch 
of the curve is the characteristic for the forward current and the left 
branch for the reverse current. The forward current increases rapidly 
with increasing voltage, but the reverse current remains almost 
constant and has a very low value. 

Which is the direction of forward current? Investigations indi- 
cate that at a p-n junction the forward current flows from the p-type 
semiconductor, through the junction, to the n-type semiconductor. 
This means that holes move toward electrons and electrons move 
toward the higher concentration of 
holes. Reverse current flows when +] 
the holes and electrons move away 
from the junction. 

In the case of a metal-semicon- 
ductor junction, the situation may 
be described as follows. If the met- 
al forms a junction with a p-type 
semiconductor (copper-cuprous ox- 
ide), the forward current will flow 
from the cuprous oxide to the 
copper. This is to be expected: 


There areno free electrons inasemi- V ae 
conductor, but in a metal there -] 
are an excess of such electrons; Fig. 306 


therefore, electrons move from the 
metal to the semiconductor. 

The characteristics of barrier layers find wide application in indus- 
trial rectifiers. Copper-oxide rectifiers (copper-cuprous oxide) and 
selenium rectifiers have been produced for a long time. During re- 
cent years, tiny germanium and silicon rectifiers—crystal diodes— 
have been widely introduced. The introduction of impurities into 
germanium or silicon may transform them into p-type or n-type 
conductors, 

A crystal diode consists of a very small germanium (or silicon) 
crystal, one part of which contains an acceptor-type impurity and 
the other a donor-type impurity. 

Also of interest are crystal triodes, representing a semiconductor 
system of the p-n-p or n-p-n type. If a wire is soldered to each of the 
three sections of such a tiny ‘triode (the dimensions of crystal “tubes” 
are of the order of a centimetre), the system may be connected in a 
circuit just as an ordinary triode tube. A voltage is applied across 
the two outer ends, one end serving as the anode and the other as 
the cathode. The third tap serves as the grid. Such a system of semi- 
conductors has two barrier layers connected in opposition. This 
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is why it behaves like a triode tube. The analogous operation of 
crystal triodes and ordinary triodes is illustrated in Fig. 307. 

Another important use of barrier layers is in the manufacture 
of photocells, which operate without a voltage source. Coating a 
semiconductor with a thin layer of metal will produce a_ barrier 
layer. A layer of metal may be made so thin that light easily passes 
through it. When light passes through the metal, an intrinsic 
photoeffect is produced in the semiconductor. The presence of a 
barrier layer causes the liberated electrons to move in a definite 
direction. An electric current will flow when the circuit is 
closed. 

Copper oxide and selenium photocells are manufactured on the 
basis of this principle. Sulphur-thallium photocells, having a short- 
circuit current of the order of 10,000 microamperes per lumen, are 


Callector 





Semtconductor Vacuum triode 
triode 


Fig. 307 


at present being used in the Soviet Union. They have an efficiency 
of transformation of light energy into electrical energy of the order 
of 1 per cent. Here too silicon and germanium p-n barrier layers are 
of fundamental significance. They enable us to manufacture photo- 
cells with an efficiency of the order of 10 per cent. This new discovery 
has placed the utilisation of solar energy on a practical basis. 


276. Contact Potential 


When two metals or semiconductors are in contact, there arises 
a difference of potential betweén them. This difference is known as 
contact potential. To measure this difference, we must remove in- 
clusions, oxide films, etc., and make close contact between the sur- 
faces of two such bodies by soldering, welding or pregrinding. In 
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this manner, we may form a circuit that is broken in one place. 
Since all points of the body are at the same potential, the contact 
potential may be determined by measuring the field in the gap. The 
order of magnitude of the contact potential between two bodies is 
equal to several tenths of a volt. 

The existence of contact potential is easily explained. It is due 
to the difference in work functions of the two adjacent bodies. It 
should be recalled that the work function A is equal to the difference 
between the energy W of an electron at the highest level inside a 
metal (at absolute zero) and the energy of an electron escaping from 
the metal with zero kinetic energy. 

The energy distribution of electrons in the case of two solids 
placed in contact can be represented as shown in Fig. 308. The upper 
level is the same for all bodies. The electrons of the body having a 
lower work function are at a higher 
energy level. Thus, the conditions 
are present for the transition of A, 
electrons from the first body to the “~7~~~"7-~7 
second. This transition is accom- 
panied by the formation of positive 
charge on the first body and nega- Fig. 308 
tive charge on the second. At the 
point of contact, there arises an electric field which impedes the tran- 
sition of electrons. Finally, equilibrium will be established at a 
panes potential difference characteristic of the given pair of 
metals. 

This picture depends little on temperature. As the temperature is 
raised, the energy distribution boundary of the electrons is no longer 
so sharply defined. Electrons appear at higher levels, but the condi- 
tions for the transition of electrons remain basically the same thanks 
to a lack of close dependence of the energy of the electrons on tem- 
perature. 

It is evident from the above description of this phenomenon that 
any group of solids may be arranged in a definite sequence such that 
each member of the sequence becomes positively charged with re- 
spect to the next. Such a series was first obtained by Volta, the dis- 
coverer of contact potential. From the above explanation of this 
phenomenon, it is clear that a Volta series corresponds to rising 
work functions, i.e., the motion of electrons between two bodies 
is in the direction of the one having a higher work function. . 

Since the contact potential @,, between two bodies may be ex- 
pressed in the form of a difference of work functions, 





1 
P12 Tp. (A, —A,), 
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it is evident that the potential difference between: two bodies may 
be expressed as the difference of the contact potentials between each 
of these bodies and a third: 


1 1 

Pie rege (A, —A,), Gis = ‘e. (A, Ay); 
1 

Pes are (A, —A,) = Pa Pai- 


Furthermore, it is evident that in a closed circuit consisting of 
any number of bodies the total contact potential is equal to zero: 
12+ 23+9,, =0. No current will flow in such a circuit. 


Electromotive Series 


(normal potential in an electrolyte solution, volts) 


Li Ca Na Al Zn Fe Ni Ph Cu Hg Ag _ Pt 
3,01 —2.84 —2.71 —1.66 —0.76 —0. 44 —0.23 —0. 12 +-0.34 +0.70 +0.80+1.2 


277. Charge Distribution in a Nonuniformly 
Heated Body 


Let us consider a rod along which a drop in temperature occurs, 
Different portions of the rod will be subject to different conditions, 
and this will affect the behaviour of the free electric charges. Where 
the temperature is higher the charges will have greater energy; 
moreover, the number of free charges may increase if electrons can 
pass from the filled band to the conduction band. Both effects tend 
to produce diffusion of free charges, which continues until a field 
which counterbalances the tendency to uniform distribution is creat- 
ed. A drop in potential will occur along the rod; negative charge is 
formed at one end of the rod and positive charge at the other. Each 
body has its own characteristic curve of potential drop as a function 
of temperature. The rate of fall of potential may be described by the 
derivative of the potential with respect to temperature: 

dp 
a =a: 

If a constant temperature difference is maintained between the 
ends of a rod, heat is transferred continuously through the rod. 
Heat is transferred by free charges, but current cannot flow in an 
open circuit. The continuous transfer of energy without the transfer 
of charge is achieved thanks to the different velocity of the charges 
moving from the hot end to the cold end, while the number of charges 
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passing through a given cross-section per unit time is the same in 
each direction. If electrons are the carriers of current, an excess con- 
centration of them is produced at the cold end of the rod. If positive 
particles or holes are the carriers of current, positive charge accumu- 
lates at the cold end. Thus, the sign of the potential difference will 
differ, depending on the sign of the current carriers. 

Will the above effect obtain in the case of a semiconductor, partic- 
ularly one in which holes as well as electrons are the carriers of 
current? As a matter of fact, such bilateral diffusion may reduce to 
zero the potential difference of a nonuniformly heated body. However, 
the potential differences formed by positive and negative current 
carriers may not balance each other. This may occur as a result of 
a difference in mobility between electrons and holes, and also as a 
result of differences in their concentrations. 

Certain difficulties occur in detecting potential differences in a 
nonuniformly heated conductor. The order of magnitude of such a 
potential drop is 10~* v/deg. This effect cannot be detected, of course, 
by forming a closed circuit of the conductor in the hope of measuring 
electric current. Such a closed circuit may be conceived of as divided 
into two halves: in one half a potential drop occurs and in the other 
a potential rise. In a uniform conductor, the magnitudes of these 
two potentials will be exactly equal; hence the emf we wish to measure 
will not be detected. 


278. Thermoelectromotive Force 


Electric current will flow in a wire ring consisting of two (or more) 
different materials if the junctions have different temperatures. 
This is the well-known thermoelectric effect, which has found broad 
practical application. 

There are two possible reasons for the flow of a thermoelectric 
current. First, it is evident that the potential drops along the two 
wires due to temperature drop may differ if the values of the constant 


a =# differ for the two materials (we shall designate them as I 


-and IT). Ta Ty, 
Thus, the potential differences \ aT and \ dT generally 


Ty, T 
are not equal. This alone would be sufficient for an emf equal to the 
difference between these voltages to arise in the wire ring. 

The second reason for thermoelectric current lies in the fact that 
contact potential quite probably depends on temperature. If the 
two junctions are placed at different temperatures, their contact 
potentials may differ. Again, this condition alone would be sufficient 
for a net potential difference to exist in the closed circuit and, hence, 
for a current to flow. 
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Taking both phenomena into account, we may express the ther- 
moelectromotive force as the sum of the voltage drop in the first 
wire, the jump in potential from the first wire to the second, the po- 
tential drop in the second wire, and the jump in potential from the 
second wire to the starting point of the circuit: 


Ta T 
é= J adT + [9 (T.)— 9 (7,)] +) a,dT + [9 (T,)— Oy (7,))- 
i T2 


To simplify the above expression, let us write the difference 
Gy (T2)—G(7,) in the form 


oe 
Pu 
(a dT 
T 


and the analogous second difference in a similar form. Now, the for- 
mula for the emf assumes the form 


=f (0G) arf (oS ar 
Ty TY, 


Thus, we have succeeded in expressing @ in the form of a difference 
between two quantities, each of which is characteristic of a given 
body. Quite often the term “ihermoelectromotive force” is used to 
refer to the emf per degree: 

dp 


Ee ay ‘ 


rather than to the above integral. This quantity is a fundamental 
characteristic of the thermoelectric properties of a body. It is not 
an invariable constant, for it may depend on thermodynamic condi- 
tions, including the temperature. However, for many bodies this 
dependence is not well defined. 

By measuring the thermoelectromotive force, we may determine 
the difference between the above quantities, but we cannot determine 
a. However, by forming different pairs of conductors and semicon- 
ductors, we are able to determine the value of a relative to a material 
taken as a “base”. Thus, materials may be arranged in a-series in 
accordance with their thermoelectromotive forces. For reasons which 
are quite understandable in view of what has been said, a thermoelec- 
tromotive force series does not coincide with the corresponding 
contact potential series, 
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Let us list the thermoelectromotive forces of several metals with respect to 
platinum. If a given metal is joined to platinum and one junction is held at 
0°C while the other is held at 100°C, an emf arises in the closed circuit: 


Antimony +4.0 mv 
9 


Iron +1 3 
Copper +0.75 ,, 
Nickel —15 ,, 


Constantan —3.4_,, 


The positive sign indicates that at the 0°-junction current flows from the given 
metal to the platinum. 


Using a table of values of the constant a, one can calculate the ther- 
moelectromotive force occurring for a given temperature difference 
from the expression @=(a,—a,)(T,—T,). This is the expression 
for a thermal element consisting of two metals or two semiconduc- 
tors the current carriers of which have the same sign. In this case, 
the potential differences arising in the two branches of the circuit are 
in phase opposition, and the resulting emf is equal to the difference 
between the effects of the two conductors forming the circuit. How- 
ever, the picture changes when the circuit is formed of two semi- 
conductors, one of which possesses hole conductivity and the other 
electron conductivity. In a p-type conductor, holes move toward the 
cold junction and electrons toward the hot junction. In an n-type 
conductor, electrons move toward the cold junction. The two effects 
reinforce each other and the formula assumes the form 


oy aes (a, +,) Cs cad T,). 


This fact is of great practical significance. 


279. Liberation of Heat in Electrical Circuits 


Joule heat is liberated in a conductor in which current flows. 
The displacement of charges along a body is accompanied by two 
other thermal effects. 

The first of these, the Peltier effect, consists in the following. If 
electric current passes through a junction between two bodies, heat 
proportional to the current strength is released or absorbed at the 


junction: 
Q=), 


» 
where II is a proportionality constant. A remarkable feature of this 
effect is that the sign and magnitude of the heat changes when the 
direction of the electric current changes, i.e., depending on this di- 
rection, a particular junction will release or absorb heat. This 
was demonstrated by Lenz more than a century ago. A drop of water 
was placed in a recess at the junction between an antimony rod and 
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a bismuth rod. Then, by passing current in one direction, he showed 
that the drop freezes. When the current was reversed, the drop melted. 

The second effect occurs in any uniform conductor which is heated 
nonuniformly. Assume, for example, current flows along a rod, one 
end of which is maintained at one temperature and the other end at 
another. In such a conductor additional heat, proportional to the 
first power of the current strength (not to the second power as in the 
case of the Joule effect), will be liberated. To detect this effect, we 
must, of course, reduce Joule heat to a minimum. 

This effect, which was predicted by the British physicist Thomson 
on the basis of thermodynamic considerations, may be demonstrated 
in the following manner. Included in the current circuit are two bars 
which are placed parallel to each other as shown in Fig. 309. The 
ends of these bars, maintained in pairs, are held at different temper- 
atures. It would seem, in view of the symmetry of the arrangement, 
that symmetrical points of the bars should have the same temperature. 

Cold However, in one bar the current flows from 

e the hot end to the cold end and in the other 

from the cold end to the hot. Owing to the 

Thomson effect, corresponding points of the 

two bars do not have the same temperature. A 

point of the bar in which the current flows 

from the hot to the cold end will be hotter 

| than the corresponding point of the other bar. 

The quantity of heat liberated per second 

H C in a segment of length dx may be written in 
the form 


OT 
dQ=tl ar dx, 


where t is a proportionality constant. The 

greater the temperature gradient, the greater 

the quantity of heat. Three effects exist si- 

Hot multaneously in a thermoelectrical circuit: 

the appearance of a thermoelectromotive force, 

the Peltier effect and the Thomson effect. 

On the basis of the principles of thermody- 

Fig. 309 namics, it can be proved that these three proc- 

esses are interconnected. This requires no 

proof for weak currents. Since the thermoelectric effects are pro- 

portional to the first power of the current and Joule heat to the second, 
the Joule heat is negligible in such cases, 


Example. The ends of arod of sodium (t=—8.5X 107° v/deg and e=5x 107° 
ohm cm) of 10-cm length and 5-mm? cross-section are maintained at tempera- 
tures of 300°K and 310°K. 
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When a current /=0.5 ma flows from the hot to the cold end of the rod, the 
heat liberated in the conductor per unit time due to the Thomson effect is 
Qr=t SE 1=— 8.5% 1078 (—5 x 10-4) x 1x 10= 4.25 108 J/sec. 


The minus sign preceding the current indicates that the current flows in the di- 
rection of decreasing temperature. The heat liberated in the conductor per unit 
time due to the Joule effect is 


Qy=PR=(5x 10-4)? x5 xX 107§ — 9, =2.5x 107 J/sec, 





5x10? 
i.e., about sao of the value of the Thomson heat. 
Thermodynamic analysis shows that the coefficients a, II and 
2 Il 7 
vw are interrelated as follows: tw and =a. Substitut- 


ing aT for II in the first relation, we obtain: t=T%.. The ab- 


solute value of a may be determined from these equations. 

The Peltier and Thomson effects have the same physical basis as 
thermoelectromotive force. In the final analysis, a thermoelectromo- 
tive force arises from the fact that heat flow transfers electric charges. 
Here, however, we are dealing with phenomena in which a flow of 
electric charges transfers heat. 
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The opportunities for the application of thermocouples as generators 
of electrical energy have increased considerably in recent times. 
Metal thermocouples have a coefficient of efficiency of the order of 
0.5 per cent, but that of a semiconductor thermocouple consisting 
of a hole segment and an electron segment has already reached as 
much as 7-8 per cent. The low efficiency results from irreversible 
losses in the form of Joule heat. If R, is the resistance of the internal 
portion of the circuit andR that of the external circuit, the power 
delivered to the external resistance (useful power) will equal Roe 
for any electrical circuit; here, € is the emf. Substituting the value 
of the thermoelectromotive force, we obtain for the power of a ther- 
mocouple the expression 


2 2 R 
a y=) (R+R,)?° 

The electromotive force of a thermocouple is of the order of several 
tenths of a volt. To obtain a voltage of 120 v, for example, therme- 
coup!es are connected in series like a battery. If heavy currents are 
required, thermocouples must be connected in parallel. 

Another important application of the thermoeffect, which has 
also become possible as the result of the development of semicon- 


ductor engineering, is the employment of semiconductors as a re- 
frigerator. 
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The application of the thermoelectric effect to the measurement 
of temperature is well known and need not be discussed. 

An important and long known field of application of the thermo- 
eflect is in the detection of very small amounts of heat. The opportu- 
nities in this field have increased still further as a result of the fact 
that semiconductors yield large thermoelectromotive forces. For 
such purposes, thermocouples connected in series—so-called ther- 
mopiles—are used. Every other junction of a thermopile is cooled 
and the alternate ones are heated. Thermopiles are used to measure 
power levels as low as several ergs per second. However, it is possible 
to lower this limit still further, i.e., to several tenths of an erg per 
second. This is achieved by means of vacuum thermocouples, the 
thermal losses of which are reduced to a minimum. 
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